7mins read The attention mechanism was introduced to improve the performance of the encoder-decoder model for machine translation. The idea behind the […]
58mins read Information Theory Information theory is a field of study concerned with quantifying information for communication. It is a subfield of mathematics […]
20mins read DenseNet Architecture Introduction In a standard Convolutional Neural Network, we have an input image, that is then passed through the network […]
25mins read Transposed Convolutions is a revolutionary concept for applications like image segmentation, super-resolution, etc but sometimes it becomes a little trickier […]
10mins read What do BERT, RoBERTa, ALBERT, SpanBERT, DistilBERT, SesameBERT, SemBERT, SciBERT, BioBERT, MobileBERT, TinyBERT and CamemBERT all have in common? And […]
30mins read Deep neural networks are trained using the stochastic gradient descent optimization algorithm. As part of the optimization algorithm, the error for […]
10mins read The Idea Behind RNNs Recurrent neural networks in general maintain state information about data previously passed through the network. This […]