Hierarchical clustering is the second most popular technique for clustering after K-means. Remember, in K-means; we need to define the number of clusters beforehand. However, in hierarchical clustering, we don’t have to specify the number of […]

Continue readingMore Tag## Types of PCA

In the era of big data, massive datasets are increasingly common in many disciplines and are often difficult to interpret. In this article, we’ll discuss the principal component analysis which is widely used as a dimensionaity […]

Continue readingMore Tag## Types of Sampling

Since it is not always possible to study the entire population, we need to rely on sampling to acquire a segment of the population to perform an experiment. It is also essential to make sure that […]

Continue readingMore Tag## Introduction To Linear Regression(Part-2)

In the previous part of the Introduction to Linear Regression, we discussed simple linear regression. Simple linear regression is a basic model with just two variables an independent variable x, and a dependent variable y based […]

Continue readingMore Tag## Optimal k in K-means

A major challenge in the K-means algorithm is choosing the optimal value of k; however, selecting the right value of k is quite tricky and is also crucial as it can impact the performance of the […]

Continue readingMore Tag## Introduction to K-means

K-means clustering is one of the simplest unsupervised learning algorithms that solve the well known clustering problem. Before we venture into K-means, let’s first understand what clustering is? What is clustering? The idea behind clustering is […]

Continue readingMore Tag## Regression Evaluation Metrics

Once we build our regression model, how can we measure the goodness of fit? We have various regression evaluation metrics to measure how well our model fits the data. In this article, we will see some […]

Continue readingMore Tag## Introduction to Linear Regression

Before we venture into linear regression, let’s first try to understand what regression analysis is? What is Regression? Regression is a statistical approach used for predicting real values like the age, weight, salary, for example. In […]

Continue readingMore Tag## Data Science Life Cycle

The data science life cycle consists of 7 phases. In this post, we will go through each of them briefly. The following infographic depicts different phases in the data science life cycle. PROBLEM DEFINITION: This phase […]

Continue readingMore Tag## Performance Measure for Classification(Part-2)

INTRODUCTION: In the last part, we see what a confusion matrix is and some other metrics like TPR, TNR, FPR, and FNR, which are based on the confusion matrix. In this post, we will see about […]

Continue readingMore Tag