How to implement Deep Learning in R using Keras and Tensorflow

This article will talk about implementing Deep learning in R on cifar10 data-set and train a Convolution Neural Network(CNN) model to classify 10,000 test images across 10 classes in R using Keras and Tensorflow packages.

For details on the Cifar10 Dataset — https://www.cs.toronto.edu/~kriz/cifar.html

But first, let’s talk a bit about R and Python —

There has always been a tough competition between R and Python when it comes to Data Science and implementing Machine Learning. R has always been a statistician’s choice because it was developed by Statisticians Ross Ihaka and Robert Gentleman at the University of Auckland,New Zealand. The project was conceived in 1992, with an initial version released in 1995 and a stable beta version in 2000. It is a descendant of S programming language.

We all know that R has always been leading when talking about Statistical Computing and Statistical Analysis of data because of the in-build packages and functions available which make both descriptive and inferential analysis very easy for us. And how can we forget the world-famous ‘GGplot2’ graphics package for making a variety of state-of-the-art Plots and Visualizations and my personal favourite ‘dplyr’ package for Data transformation and Data Manipulation which also supports SQL like syntax and functions.

P.S — I am personally a R lover .

R and its libraries implement a wide variety of statistical and graphical techniques, including linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, and others. R is easily extensible through functions and extensions, and the R community is noted for its active contributions in terms of packages. Many of R’s standard functions are written in R itself, which makes it easy for users to follow the algorithmic choices made.

Advanced users can easily write C, C++ and Python code to manipulate R objects directly.

But due to the Data Explosion in the past few years,since then Deep Learning started gaining lots of importance due to the advancements in Computational Power and our ability to Process, Manage and store such Big complex data Sets easily that too at a cheaper cost , since then Python has been leading in terms of implementing Deep learning easily using its famous Deep learning libraries such as ‘Keras’ , ‘Tensorflow’ etc. The balance now shifted towards Python as it had an enormous list of Deep Learning libraries and frameworks which R lacked .

Python was now leading the Deep Learning World in every way it can, because with R it was almost impossible to run big complex deep learning Models , but not anymore, R is back in the fight again .

Due to the recent launch of Keras library in R with Tensorflow (CPU and GPU compatibility) at the backend, it is again back in the competition. R will again fight Python for the podium even in the Deep Learning world. For aspiring Data Scientists like me who know only R it is a big relief and an advantage to implement Deep learning in R itself.

NOTE-

Firstly, If you are not familiar with the basics of R language, I urge readers to go complete this amazing course by DataCamp on R Programming , and this is the course from where I started my Data science journey using R. Trust me with this, these course are worth your time and money or if you want to learn the complete data science fundamentals from scratch using R which includes Statistics and probability, Data viz, exploring the data, data modelling and machine learning and other important concepts, I strongly recommend this foundation course Data science with R and I am currently enrolled in this foundation course and I am already loving it.

Also, I have noticed that DataCamp is having a SALE(75%off) on all the courses. So this would literally be the best time to grab some yearly subscriptions(which I have) which basically has unlimited access to all the courses and other things on DataCamp and make fruitful use of your time sitting at home during this Pandemic. So go for it folks and Happy learning, make the best use of this quarantine time and come out of this pandemic stronger and more skilled.

Anomaly Detection in R is another amazing course which I loved doing.Then this course on Joining data in R is a course which I did initially when I was learning R for data analysis and helped me a lot to develop data pre-processing skills.

If learning machine learning from scratch is what you want then this course is one of the best for understanding basics of Machine learning in R called Machine learning Toolbox . There is a course on Predictive analysis in R for Networked data and network analysis which I recently started and I am just loving it. It is amazing for understanding the concepts of Network analysis. It is something new in the market and really interesting.

For all the python lovers, I am also attaching some lovely and hand picked courses in python on learning and implementing Deep learning fundamentals — these courses Deep learning in Python using keras, Building chatbots in Python ,NLP fundamentals in Python using NLTK would be the best choice for the python users interested in deep learning. So give these a try based on you interest.

If you are more interested in learning machine learning fundamentals using python then this course is the best to start off with Supervised learning in Python using Scikit-learn.

P. S — So the above courses are some personally handpicked courses which are the best ones out there to start building a good foundation in Data science.I really believe DataCamp offers the best tutorials and data science courses . I say this because DataCamp is where I started my journey into data science and analytics. And trust me with this, the courses are worth you time and money. So do try these out , you will love these courses. I give you this guarantee. People all the time write about different articles and tutorials but they never mention from where they learned these skills and the resources which helped them. So I just wanted to give a side note add the sources which have helped me a lot to become what I am today.

Now let’s start off with the implementation in R —

Installing the Dependencies —

First of all we need to install Keras package for R from github which will include installing ‘Reticulate’ package for interface of Python in R and then ‘Tensorflow’ package.

#Installing Keras and Tensorflow at the Backend

#You need to install Rtools for installing ‘reticulate’ package for using the above packahes in R.

#The reticulate package provides an R interface to Python modules, classes, and functions

#You can install Rtools for your R version here — 
#https://cran.r-project.org/bin/windows/Rtools/

#installing 'devtools' package for installing Packages from github

install.packages('devtools')

#installing keras

devtools::install_github("rstudio/keras")

The above code will load Keras library from github and now we need to load the keras package .

#Loading the keras package

library(keras)

#The R interface to Keras uses TensorFlow as it’s underlying #computation engine.So we need to install Tensorflow engine

Now we need to install the Tensorflow Engine for R. By default RStudio loads the CPU version of tensorflow. Use the below command to download the CPU version of tensorflow.

install_tensorflow()

This will install and download the CPU version of Tensorflow which will do all the computations in the backend on the CPU.

For installing the GPU version of the Tensorflow –

install_tensorflow(gpu = T)

For more on how to install Tensorflow in R refer this link-

Installing the GPU Version

Different types of models that can be built in R using keras Package —

Below is the list of different Neural Network models that can be built in R using Keras.

Multi-Layer Perceptrons
Convoluted Neural Networks
Recurrent Neural Networks
Skip-Gram Models
Use pre-trained models like VGG16, RESNET etc.
Fine-tune the pre-trained models.

Let us start with building a very simple CNN model and try to classify 10,000 32X32 cifar10 images .

#loading keras library
library(keras)

#loading the keras inbuilt cifar10 dataset

?dataset_cifar10 #to see the help file for details of dataset

cifar<-dataset_cifar10()

It might take some time to download the dataset as it is around ~ 100 MB large . After loading the dataset , lets separate Training and Test Data .

#TRAINING DATA
train_x<-cifar$train$x/255

#convert a vector class to binary class matrix
#converting the target variable to once hot encoded vectors using #keras inbuilt function 'to_categorical()

train_y<-to_categorical(cifar$train$y,num_classes = 10)

#TEST DATA
test_x<-cifar$test$x/255

test_y<-to_categorical(cifar$test$y,num_classes=10)

#checking the dimentions
dim(train_x)

cat("No of training samples\t",dim(train_x)[[1]],"\tNo of test samples\t",dim(test_x)[[1]])

Now let’s define and configure our CNN model’s Architecture —

#a linear stack of layers
model<-keras_model_sequential()

#configuring the Model

model %>%  
#defining a 2-D convolution layer
  
layer_conv_2d(filter=32,kernel_size=c(3,3),padding="same",                input_shape=c(32,32,3) ) %>%  
layer_activation("relu") %>%

#another 2-D convolution layer
  
layer_conv_2d(filter=32 ,kernel_size=c(3,3))  %>%  layer_activation("relu") %>%

#Defining a Pooling layer which reduces the dimentions of the #features map and reduces the computational complexity of the model
layer_max_pooling_2d(pool_size=c(2,2)) %>%

#dropout layer to avoid overfitting

layer_dropout(0.25) %>%

layer_conv_2d(filter=32 , kernel_size=c(3,3),padding="same") %>% layer_activation("relu") %>%  layer_conv_2d(filter=32,kernel_size=c(3,3) ) %>%  layer_activation("relu") %>%  
layer_max_pooling_2d(pool_size=c(2,2)) %>%  
layer_dropout(0.25) %>%

#flatten the input  
layer_flatten() %>%

layer_dense(512) %>%  
layer_activation("relu") %>%

layer_dropout(0.5) %>%

#output layer-10 classes-10 units  
layer_dense(10) %>%

#applying softmax nonlinear activation function to the output layer #to calculate cross-entropy

layer_activation("softmax")

#for computing Probabilities of classes-"logit(log probabilities)

Now after Defining the Architecture of our CNN model we need to Compile and define the type of Loss function and a Optimizer for our Model which will do the Parameter Updates.

#Model's Optimizer

#defining the type of optimizer-ADAM-Adaptive Momentum Estimation

opt<-optimizer_adam( lr= 0.0001 , decay = 1e-6 )

#lr-learning rate , decay - learning rate decay over each update


model %>%
 compile(loss="categorical_crossentropy",
 optimizer=opt,metrics = "accuracy")

#Summary of the Model and its Architecture
summary(model)

Now after all this its time to Train our Model on the images-

#TRAINING PROCESS OF THE MODEL

data_augmentation <- TRUE

if(!data_augmentation) {  
model %>% fit( train_x,train_y ,batch_size=32,
               epochs=80,validation_data = list(test_x, test_y),
               shuffle=TRUE)
}
  else {

#Generating images
  
gen_images <- image_data_generator(featurewise_center = TRUE,
      featurewise_std_normalization = TRUE,
      rotation_range = 20,
      width_shift_range = 0.30,
      height_shift_range = 0.30,
      horizontal_flip = TRUE  )

#Fit image data generator internal statistics to some sample data
gen_images %>% fit_image_data_generator(train_x)
#Generates batches of augmented/normalized data from image data and #labels to visually see the generated images by the Model

model %>% fit_generator(
     flow_images_from_data(train_x, train_y,gen_images,
     batch_size=32,save_to_dir="F:/PROJECTS/CNNcifarimages/"),
     steps_per_epoch=as.integer(50000/32),epochs = 80,
     validation_data = list(test_x, test_y) )

#use save_to_dir argument to specify the directory to save the #images generated by the Model and to visually check the Model's #output and ability to classify images.

Now The above Model gave me a accuracy of 86.667 % on validation set after hours of Training. It took about 3–4 hours for the model to train and learn on my i5 Notebook with 8Gb of RAM and 64-Bit 2.63 ghz processor.

Don’t forget to give us your ? !

How to implement Deep Learning in R using Keras and Tensorflow was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/how-to-implement-deep-learning-in-r-using-keras-and-tensorflow-82d135ae4889?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/how-to-implement-deep-learning-in-r-using-keras-and-tensorflow

How to implement Deep Learning in R using Keras and Tensorflow

But first, let’s talk a bit about R and Python —

NOTE-

Installing the Dependencies —

Different types of models that can be built in R using keras Package —

Now let’s define and configure our CNN model’s Architecture —

Top 4 Most Popular Ai Articles:

Do check out another Deep learning Model implemented by me on MNIST data set below —

Don’t forget to give us your ? !

Published by 365Data Science

Leave a comment Cancel reply

But first, let’s talk a bit about R and Python —

NOTE-

Installing the Dependencies —

Different types of models that can be built in R using keras Package —

Now let’s define and configure our CNN model’s Architecture —

Top 4 Most Popular Ai Articles:

Do check out another Deep learning Model implemented by me on MNIST data set below —

Don’t forget to give us your ? !

Share this:

Related

Published by 365Data Science

Leave a comment Cancel reply