How do you analyze data for a dissertation? Let me show you on the following illustrative example:

Illustrative example: Predicting car prices

Let us assume that the purpose of the statistical analysis is to verify which factors affect the price of the car, so we can predict it.

In the following table you can see a sample of the data that we will use:

First, we present data graphically. The graph below shows the percentage of each fuel type in the data.

To test if the factors above affect price we use linear regression. It describes the linear relationship between variables X and Y. Mathematically, we can write this relationship as:

where yi is dependent variable, xi are independent variables and u is error term. We focus on the results of regression parameters, which show if there is an effect on the dependent variable or not. If the p-value is higher than 0.05, it means that this factor is not statistically significant and it does not affect the price of the car.

Table below describes the basic regression statistics. R-Square is 0.987198, which means that almost 99 % of variation of a dependent variable (Price) is explained by the independent variables (Year,Mileage,No. of doors)

P-value of Mileage is lower than 0.05. It means that this factor is statistically significant and it does affect the price of the car.

We can see that there is a negative linear relationship between price and mileage. The higher the mileage, the lower the price.



Dissertation data analysis forms a crucial part of the entire dissertation. If the data analysis is not correct, then the whole research project will fail. That is the main reason you need an excellent statistical consultant on your side.

