Some projects done in different stages of my academic career.
On a Nonparametric Test for Dependence
Literature review and discussion on a paper which utilizes two different correlation coefficients to aggregate them into one and come up with a distribution free test for independence betweeen two random variables.
A Study on Non Gaussian Graphical Models
We first try to understand what are graphical models and why are they needed. Then we briefly discuss about the existing literature on one of the simplest cases which is when the nodes of the graph jointly follow a multivariate normal distribution. Next, we move on to study what methods can be applied when the data is not normally distributed, which is often the case in practice. To study non gaussian scenarios, we do a comprehensive literature review on nonparametric methods for continuous data and also study discrete graphical models. At last, we try to see if we can have a common framework to study both discrete and continuous variables together. In most of the setups, we try to study the performance of the methods in different situations like for large sample size, for deviation from normality and for higher dimensions.
On a Robust Correlation Coefficient
Review and discussion on a paper published in JRSSD which proposes a robust version of the usual correlation coefficient based on LMS regression. Simulations and applications on real data are demonstrated.
A Study on Kernel Density Estimation
Starting with the simplest form of density estimation i.e. histograms, identifying their drawbacks and step by step trying to overcome them, we eventually illustrate KDE.Simulations from different distributions and taking different choices of kernels show how good the estimates holds.Asymptotic properties are verified.Using a similar approach via kernels, we try to estimate the CDF.Detailed simulations done for each case.Finally we do a case study.
Regression Analysis of Petrol Consumption Data
Starting with the linear regression model with all predictors involved, performed regression diagnostics, did appropriate transformations to the data to remove unwanted stuff like heteroscedasticity,multicollinearity,non-normality of errors,etc.Also,variable selection was performed.Finally presented the model which is the best fit to the data.