Introduction to Linear Regression

Before we venture into linear regression, let’s first try to understand what regression analysis is? What is Regression? Regression is a statistical approach used for predicting real values like the age, weight, salary, for example. In […]

Data Science Life Cycle

The data science life cycle consists of 7 phases. In this post, we will go through each of them briefly. The following infographic depicts different phases in the data science life cycle. PROBLEM DEFINITION: This phase […]

Performance Measure for Classification(Part-2)

INTRODUCTION: In the last part, we see what a confusion matrix is and some other metrics like TPR, TNR, FPR, and FNR, which are based on the confusion matrix. In this post, we will see about […]

Performance Measures for Classification

Why do we need performance measures? Why do we need performance measures at all? After we developed our classification model, we need to asses the performance of the model. Obviously, we can use accuracy as a […]

Dropout for Regularization

INTRODUCTION: When we have deep neural networks, the biggest problem is overfitting. We can say a neural network overfits when it has excellent performance in the training data and has a poor performance in unseen data. […]

Introduction to activation functions

Why do we need activation functions? We use activation function to introduce non-linearity into the neural network. Activation functions convert independent variables of near infinite range into simple probabilities between 0 and 1. Now the question […]

Introduction to Principal Component Analysis(PCA)

Principal Component Analysis is an unsupervised dimensionality reduction technique, which is extensively used in machine learning. It helps us to alleviate the problem of the curse of dimensionality by reducing the dimension of the data. PCA […]

Introduction to T-SNE with implementation in python

INTRODUCTION to T – SNE: T-SNE is a non-linear dimensionality reduction technique used to visualize high-dimensional data in two or more dimensions. Unlike PCA which preserves only the global structure of the data T-SNE preserves both the local […]

Distance/Similarity Measures in Machine Learning

INTRODUCTION: For algorithms like the k-nearest neighbor and k-means, it is essential to measure the distance between the data points. In KNN we calculate the distance between points to find the nearest neighbor, and in K-Means […]