MXNet features fast implementations of many state-of-the-art models reported in the academic literature. This Model Zoo is an ongoing project to collect complete models, with python scripts, pre-trained weights as well as instructions on how to build and fine tune these models.
The Model Zoo has good entries for CNNs but is seeking content in other areas.
Issue a Pull Request containing the following:
Readme file should contain:
Convolutional neural networks are the state-of-art architecture for many image and video processing problems. Some available datasets include:
For instructions on using these models, see the python tutorial on using pre-trained ImageNet models.
Model Definition | Dataset | Model Weights | Research Basis | Contributors |
---|---|---|---|---|
CaffeNet | ImageNet | Param File | Krizhevsky, 2012 | @jspisak |
Network in Network (NiN) | ImageNet | Param File | Lin et al.., 2014 | @jspisak |
SqueezeNet v1.1 | ImageNet | Param File | Iandola et al.., 2016 | @jspisak |
VGG16 | ImageNet | Param File | Simonyan et al.., 2015 | @jspisak |
VGG19 | ImageNet | Param File | Simonyan et al.., 2015 | @jspisak |
Inception w/ BatchNorm | ImageNet | Param File | Szegedy et al.., 2015 | @jspisak |
ResidualNet152 | ImageNet | Param File | He et al.., 2015 | @jspisak |
ResNext101-64x4d | ImageNet | Param File | Xie et al.., 2016 | @Jerryzcn |
Fast-RCNN | PASCAL VOC | [Param File] | Girshick, 2015 | |
Faster-RCNN | PASCAL VOC | [Param File] | Ren et al..,2016 | |
Single Shot Detection (SSD) | PASCAL VOC | [Param File] | Liu et al.., 2016 | |
LocationNet | MultimediaCommons | Param File | Weyand et al.., 2016 | @jychoi84 @kevinli7 |
MXNet supports many types of recurrent neural networks (RNNs), including Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU) networks. Some available datasets include:
Model Definition | Dataset | Model Weights | Research Basis | Contributors |
---|---|---|---|---|
LSTM - Image Captioning | Flickr8k, MS COCO | Vinyals et al.., 2015 | @... | |
LSTM - Q&A System | bAbl | Weston et al.., 2015 | ||
LSTM - Sentiment Analysis | IMDB | Li et al.., 2015 |
Generative Adversarial Networks train a competing pair of neural networks: a generator network which transforms a latent vector into content like an image, and a discriminator network that tries to distinguish between generated content and supplied "real" training content. When properly trained the two achieve a Nash equilibrium.
Model Definition | Dataset | Model Weights | Research Basis | Contributors |
---|---|---|---|---|
DCGANs | ImageNet | Radford et al..,2016 | @... | |
Text to Image Synthesis | MS COCO | Reed et al.., 2016 | ||
Deep Jazz | Deepjazz.io |
MXNet Supports a variety of model types beyond the canonical CNN and LSTM model types. These include deep reinforcement learning, linear models, etc.. Some available datasets and sources include:
Model Definition | Dataset | Model Weights | Research Basis | Contributors |
---|---|---|---|---|
Word2Vec | Google News | Mikolov et al.., 2013 | @... | |
Matrix Factorization | MovieLens 20M | Huang et al.., 2013 | ||
Deep Q-Network | Atari video games | Minh et al.., 2015 | ||
Asynchronous advantage actor-critic (A3C) | Atari video games | Minh et al.., 2016 |
Can you improve this documentation? These fine people already did:
jspisak, Sandeep Krishnamurthy, Mu Li, Lou, jychoi84, Yao Wang, Sheng Zha, Jerry Zhang, Alex Li & Leo DiracEdit on GitHub
cljdoc is a website building & hosting documentation for Clojure/Script libraries
× close