Deep Learning Cheat Sheet

30-04-2021

Deep Learning Cheat Sheet

Deep Learning RNN Cheat Sheet. Neural Networks has various variants like CNN (Convolutional Neural Networks), RNN (Recurrent Neural Networks), AutoEncoders etc. RNN are designed to work with sequence prediction problems (One to Many, Many to Many, Many to One). RNN is recurrent as it performs the same task for every element of a sequence, with. The 2-page cheat sheet gives you a quick overview of the Keras pipeline for deep learning. It shows you how to work with models (e.g. Definition, training, prediction, fitting, and evaluation). Furthermore, it gives you a visual overview of how to access the diverse layers in the neural network.

Download the complete set of ultra HD AI cheat sheets. This PDF includes the most comprehensive set of cheat sheets for AI, Neural Networks, Machine Learning, Deep Learning & Data Science. Cheatsheets enumerating everything about convolutional neural systems,recurrent neural networks, as well as the DL tips and traps to have at the top of the priority list when preparing a deep learning model. All components of the above joined in an ultimate arrangement of ideas, to have with you consistently. Shervine Amidi, graduate student at Stanford, and Afshine Amidi, of MIT and Uber - creators of a recent set of machine leanring cheat sheets - have just published a new set of deep learning cheat sheets. These 'VIP cheat sheets' are based on the materials from Stanford's CS 230 (Github repo with PDFs available here), and include topics such as.

The question that I get the most from new and experienced machine learning engineers is “how can I get higher accuracy?”

Makes a lot of sense since the most valuable part of machine learning for business is often its predictive capabilities. Improving the accuracy of prediction is an easy way to squeeze more value from existing systems.

The guide will be broken up into four different sections with some strategies in each.

Data Optimization
Algorithm tuning
Hyper-Parameter Optimization
Ensembles, Ensembles, Ensembles

Not all of these ideas will boost model performance, and you will see limited returns the more of them you apply to the same problem.

Still, stuck after trying a few of these? That indicates you should rethink the core solution to your business problem. This article is just a deep learning performance cheat sheet, so I’m linking you to more detailed sources of information in each section.

Data Optimization

Balance your data set

One of the easiest ways to increase performance for underperforming deep learning models is to balance your dataset if your problem is classification. Often real-world data sets are skewed, and if you want the best accuracy you want your deep learning system to learn how to pick between two classes based on the characteristics not by copying its distribution

Common methods include:

Subsample Majority Class: You can balance the class distributions by subsampling the majority class.
Oversample Minority Class: Sampling with replacement can be used to increase your minority class proportion.

More Data

Computer Learning Cheat Sheets

Many of us are familiar with this graph. It shows the relationship between the amount of data and performance for both deep learning and classical machine learning approaches. If you are not, then the lesson is clear and straightforward. If you want better performance for your model, you need more data. Depending on your budget you might opt for creating more labeled data or collecting more unlabeled data and training your feature extraction sub-model more.

Open Source Labeling Software

Generate More Data

Or Fake it, till you make it. An often ignored method of improving accuracy is creating new data from what you already have. Take for example photos; often engineers will create more images by rotating and randomly shifting existing images. Such transformations also increase the reduced overfitting of the training set.

Algorithm Tuning

Copy The Researchers

Are you working on a problem that has lots of research behind it? You are in luck because 100’s of engineers might have already put a bunch of thought into how to get the best accuracy for this problem. Read some research papers on the topic and take note of the different methods they used to get results! They might even have a git-hub of their code for you to sink your teeth into.

Google Scholar is an excellent place to start your search. They offer many tools to help you find related research as well.

For storage and organization of research papers, I use Mendeley

Algorithm spot check

Don’t let your ego get the best of you. It’s impossible to know which machine learning algorithm will work best for your problem. Whenever I attack a new problem, with not much in the way of research behind it, I look at a few methods available and try all of them. Deep learning (CNN’s, RNN’s, etc.) and classical machine learning approaches (Random Forests, Gradient Boosting, etc.)

Rank the results of all your experiments and double down on the algorithms that perform the best.

Hyper-Parameter Optimization

Learning rates

Deep Learning Cheat Sheet Github

The Adam optimization algorithm is tried and true. Often giving amazing results on all deep learning problems. Even with Its fantastic performance it still can get you stuck in a local minimum for your problem. An even better algorithm that has the benefits of Adam and helps eliminate the chance of getting stuff in a local minimum is Stochastic Gradient Descent with Warm Restarts.

Batch size and number of epochs

A standard procedure is using large batch sizes with a large number of epochs for modern deep learning implementations, but common strategies yield common results. Experiment with the size of your batches and the number of training epochs.

Early Stopping

Deep Learning Cheat Sheet

This is an excellent method for reducing the generalization error of your deep learning system. Continual training might improve accuracy on your data set, but at a certain point, it starts to reduce the model’s accuracy on data not yet seen by the model. To improve real-world performance try early stopping.

Network Architecture

If you want to try something a little more interesting, you can give Efficient Neural Architecture Search (ENAS) a try. This algorithm will create a custom network design that will maximize accuracy on your dataset and is way more efficient than the standard Neural architecture search that cloud ML uses.

Regularization

A robust method to stop overfitting is to use regularization. There are a couple of different ways to use regularization that you can train on your deep learning project. If you haven’t tried these methods yet I would start to include them in every project you do.

Dropout: They randomly turn off a percentage of neurons during training. Dropout helps prevent groups of neurons from all overfitting to themselves.
Weight penalty L1 and L2: Weights that explode in size can be a real problem in deep learning and reduce accuracy. One of the ways to combat this is to add decay to all weights. These try to keep all of the weights in the networks as small as possible unless there are large gradients to counteract it. On top of often increasing performance, it has the benefit of making the model easier to interpret.

Ensembles, Ensembles, Ensembles

Having trouble picking the best model to use? Often you can combine the outputs from the different models and get better accuracy. There are two steps for every one of these algorithms.

Producing a distribution of simple ML models on subsets of the original data
Combining the distribution into one “Aggregated” model

Combined Models/Views (Bagging)

In this method, you train a few different models, which are different in some way, on the same data and you average out the outputs to create the final output. Bagging has the effect of reducing variance in the model. You can intuitively think of it as having multiple people with different backgrounds thinking about the same problem but with different starting positions. Just as on a team this can be a potent tool for getting the right answer.

Stacking

It’s similar to bagging the difference here is that you don’t have an empirical formula for your combined output. You create a meta-level learner that based on the input data chooses how to weigh the answers from your different models to produce the final output.

Still having issues?

Reframe Your Problem

Take a break from looking at your screen and get a coffee. This solution is all about rethinking your problem from the beginning. I find it helps to sit down and start brainstorming different ways that you could solve the problem. Maybe start by asking your self some simple questions:

Can my classification problem become a regression problem or the reverse?
Can you break down your problem any smaller?
Are there any observations that you have collected about your data that could change the scope of the problem?
Can your binary output become a softmax output or vice versa?
Are you looking at this problem in the most efficient way?

Rethinking your problem can be the hardest of the methods to increase performance but it often the one that yields the best results. It helps to chat with someone that has experience in deep learning and can give you a fresh take on your problem.

If you would like to chat with someone, I am making myself self-available for the next month to have a 30-minute conversation with you about your project. I’m charging 5 dollars for this 30-minute call as a barrier to keep those out who are not serious from wasting our time. Sign up for a time slot

Also read: Top 6 Cheat Sheets Novice Machine Learning Engineers Need

Thanks for reading If you enjoyed the post, share the article with anyone you think needs it. Let’s also connect on Twitter, LinkedIn, or follow me on Medium.

Deep Learning Cheat Sheet 2020

Cheat sheets for machine learning are plentiful. Quality, concise technical cheat sheets, on the other hand... not so much. A good set of resources covering theoretical machine learning concepts would be invaluable.

Deep Learning Cheat Sheet Reddit

Shervine Amidi, graduate student at Stanford, and Afshine Amidi, of MIT and Uber, have created just such a set of resources. The VIP cheat sheets, as Shervine and Afshine have dubbed them (Github repo with PDFs available here), are structured around covering key top-level topics in Stanford's CS 229 Machine Learning course, including:

Notation and general concepts
Linear models
Classification
Clustering
Neural networks
... and much more

Links to individual cheat sheets are below:

You can visit Shervine's CS 229 resource page or the Github repo for more information, or can download the cheat sheets from the direct download links above.

You can also find all of the sheets bundled together into a single 'super VIP cheat sheet.'

Thanks to Shervine and Afshine for putting these fantastic resources together.

Related:

Mgmtblog886

Deep Learning Cheat Sheet

Data Optimization

Balance your data set

More Data

Computer Learning Cheat Sheets

Generate More Data

Algorithm Tuning

Copy The Researchers

Algorithm spot check

Hyper-Parameter Optimization

Learning rates

Deep Learning Cheat Sheet Github

Batch size and number of epochs

Early Stopping

Deep Learning Cheat Sheet

Network Architecture

Regularization

Ensembles, Ensembles, Ensembles

Combined Models/Views (Bagging)

Stacking

Still having issues?

Reframe Your Problem

Deep Learning Cheat Sheet 2020

Deep Learning Cheat Sheet Reddit

MOST POPULAR ARTICLES