Finding Relationship Between Variables – ANOVA (Part 1) ANOVA Test in Python Finding the relationship between variables is a very important step in any statistical modeling. For example, you are working in a dataset which contains hundreds of variables but very few observations, you cannot simply include all those hundreds of variables in your modeling. Otherwise you will be violating…

## Data Visualisation in R (Part-3)

Data Visualisation in R (Part-3) Introduction In this report I will plot some more advanced charts using ggplot2 package. If you want to learn more about some basic plots you can refer to my earlier articles Data Visualization in R (Part 1) and Data Visualization in R (Part 2) library(Hmisc) library(dplyr) library(ggplot2) library(ggplot2movies) library(RColorBrewer) library(PerformanceAnalytics) library(GGally) Boxplots and Variable Transformation…

## Geospatial Analytics for Boosting Sales

Geospatial analytics The power of big data analytics has been widely acknowledged by the decision makers and analysts worldwide. But still, big data has not been utilized to its potential by the analysts, especially the location data. Location data, also known as the geospatial data or geographical information, has been on the rise ever since the advancement of technology. The…

## Introduction to Linear Discriminant Analysis

Linear Discriminant Analysis Linear Discriminant Analysis or most commonly known as ‘LDA’ is one of the most interesting machine learning techniques till date.The idea was first coined by “Dr. Ronald Fisher” to classify binary classes using ‘Fisher’s linear discriminant‘ and later on it was generalized for multiple classes as well.In case of a binary class problem, LDA acts as a classifier like…

## Mumbai Local Railway Map in Tableau

About Tableau Tableau is one of the most popular tools for data visualization. Here I am going to use tableau to plot the railway and metro network in Mumbai city. The objective of this article is to show step by step method to plot any type of network which can be Rail Network, Road network on the map using Tableau.…

## Data Visualization in R (Part-2)

Introduction In this report, I will plot some more advanced charts using packageggplot2. If you want to learn more about some basic plots you can refer to my earlier article Data Visualization in R (Part 1). Also, you can view other posts related to visualizations here. library(ggplot2) library(RColorBrewer) Data Smoothing in plots Smoothing means to use algorithms to remove noise…

## Branches of Statistics and Types of Data

Understanding the world of Statistics This is the first post in this category where we would be going deeper into the magical world of statistics. Without the knowledge of statistics, analytics is incomplete. In fact, anyone aspiring to be a data analyst must first learn this before moving to more advanced topics like machine learning etc. For this introductory post, I…

## Data Visualization-R (Part-1)

Data Visualisation – R (Part-1) Introduction In this report, I will use different datasets to plot the data to gain some meaningful insights using ggplot2 package. There is one more post which explains how to visualize maps in R using ggmaps package, you can read more about it here. This post will cover basics of data visualisation-R. Some basic plots First load the…