Hierarchical clustering is the second most popular technique for clustering after K-means. Remember, in K-means; we need to define the number of clusters beforehand. However, in hierarchical clustering, we don’t have to specify the number of […]
Continue readingMore TagAuthor: Niranjan B Subramanian
Types of PCA
In the era of big data, massive datasets are increasingly common in many disciplines and are often difficult to interpret. In this article, we’ll discuss the principal component analysis which is widely used as a dimensionaity […]
Continue readingMore TagTypes of Sampling
Since it is not always possible to study the entire population, we need to rely on sampling to acquire a segment of the population to perform an experiment. It is also essential to make sure that […]
Continue readingMore TagIntroduction To Linear Regression(Part-2)
In the previous part of the Introduction to Linear Regression, we discussed simple linear regression. Simple linear regression is a basic model with just two variables an independent variable x, and a dependent variable y based […]
Continue readingMore TagOptimal k in K-means
A major challenge in the K-means algorithm is choosing the optimal value of k; however, selecting the right value of k is quite tricky and is also crucial as it can impact the performance of the […]
Continue readingMore TagIntroduction to K-means
K-means clustering is one of the simplest unsupervised learning algorithms that solve the well known clustering problem. Before we venture into K-means, let’s first understand what clustering is? What is clustering? The idea behind clustering is […]
Continue readingMore TagRegression Evaluation Metrics
Once we build our regression model, how can we measure the goodness of fit? We have various regression evaluation metrics to measure how well our model fits the data. In this article, we will see some […]
Continue readingMore TagIntroduction to Linear Regression
Before we venture into linear regression, let’s first try to understand what regression analysis is? What is Regression? Regression is a statistical approach used for predicting real values like the age, weight, salary, for example. In […]
Continue readingMore TagData Science Life Cycle
The data science life cycle consists of 7 phases. In this post, we will go through each of them briefly. The following infographic depicts different phases in the data science life cycle. PROBLEM DEFINITION: This phase […]
Continue readingMore TagPerformance Measure for Classification(Part-2)
INTRODUCTION: In the last part, we see what a confusion matrix is and some other metrics like TPR, TNR, FPR, and FNR, which are based on the confusion matrix. In this post, we will see about […]
Continue readingMore Tag