For my class project, I used the data from the 2019 7th World Happiness Report. Each year the report focuses on some themes of well being and happiness. This year the report was on happiness and the community and looked at the link between government actions and happiness, happiness and community, and happiness and technology. Their study focused on six explanatory factors: social support, healthy life expectancy, GDP per capita, generosity, freedom to make life choices, and freedom from corruption. I chose this topic as, during the midst of the COVID019 pandemic, mindfulness and wellness have been common themes in the media.
In this study, the Cantril Ladder was used to measure overall well-being. The Cantil Ladder is often compared to a 10 rung step ladder that we may have in our tool shed, with the first rung being a score of 1 and the 10th rung rated as a score of 10. Features included in the study: GDP per capita, social support, healthy life expectancy, freedom, generosity, and absence of corruption.
The study uses the variables to explain the variation of happiness across countries. For my project, I focused on the impact of social support, healthy life expectancy, GDP, and generosity on happiness. I also analyzed the factors contributing to a healthy life expectancy.
Two questions I have are:
My data was rank-ordered from 1 to 156, so I found no outliers. In my initial analysis, I only considered four features: social support, generosity, GDP per capita, and healthy life expectancy. 12 null values were detected, and I chose to delete the rows with null values. The table below shows what I found during my initial look at the data.
Using the features that I had selected initially, social support, generosity GDP per capita, and healthy life expectancy, I plotted the data for a visual representation of the relationships. Using KNIME for the analysis tool, I found that three of the four features had similar scatter plots so I could not tell which variable was more dominant. I then turned to a linear regression model, a formal supervised machine learning technique. R2 is a statistical measure of fit that indicates how much variation of a dependent variable is explained by the independent variable(s) in a regression model. As shown on the next page, the inclusion of all features in the original study provided the highest R2, meaning that the inclusion of all features was needed in the model to optimize the likelihood of happiness. R2 never reached 1, meaning that some other factor was missing from the model or that despite the inclusion of all the variables, there was no guarantee that people would be happy despite the most optimal conditions.
The scatterplot (left) of the four features shows social support is the most influential feature of happiness. This combination of features yielded an adjusted R2 of 0.7816.
I experimented with the features used in the study to find the “optimal” combination to find happiness. After trial-and-error, I included the six features in the original study plus what the author referred to as the positive effect and the negative effect. Per the authors of the original study, “Positive effect comprises the average frequency of happiness, laughter, and enjoyment on the previous day, and negative affect comprises the average frequency of worry, sadness, and anger on the previous day. The affect measures thus lie between 0 and 1.” The negative effect lowers the overall score by .08 but considered it necessary as we were including the positive effect. The additional features raised the R2 value by .0328. See below.
The scatterplot (left) of the original six features as well as the positive and negative affect were used to assess how the features contribute to happiness. This combination of features yielded an adjusted R2 of 0.816.
During the pandemic, many people are living in social near-isolation if they are without some tools such as Zoom and Facetime. My initial thought was that the lack of social support would have an adverse impact on a healthy life expectancy. The data below shows that GDP per capita has the greatest influence on a healthy life expectancy. In this case, we have an adjusted R2 of 0.7289 with social support, generosity, and GDP per capita factored into the equation.
The scatterplot (left) of the original six features as well as the positive and negative affect were used t to assess how the features contribute to happiness. This combination of features yielded an adjusted R2 of 0.816.
Again, trying to find the optimal combination of features to predict a healthy life expectancy, I added freedom and lack of government corruption to the equation and raised the adjusted R2 to 0.7708. GDP per capita continues to be a leading feature in determining the likelihood of a healthy life expectancy.
The scatterplot (left) of the original six features as well as the positive and negative affect were used t to assess how the features contribute to happiness. This combination of features yielded an adjusted R2 of 0.816.
I chose to use cluster analysis for unsupervised machine learning. For my first question, what features most influence happiness, I would include in my analysis three of the four features I identified in my initial list of features: social support, healthy life expectancy, and GDP per capita. I would cluster common happiness indicator scores and look to see what scores assigned to the features and look for commonalities. For clusters with a similar happiness score, what range of scores are associated closely with each of the features?
For my second question, what features most influence a healthy life expectancy, I would perform a similar analysis as mentioned above. I would group (cluster) similar healthy life expectancy scores and look at the scores assigned to the features GDP per capita, social support, and generosity.
From a review of the scatter plots alone, it was difficult to tell which factors had the strongest impact on happiness and healthy life expectancy. Regression analysis assisted in assessing the best combination of features for happiness as well as a healthy life expectancy. All features included in the original study provided the best indicator of happiness
Essay Writing Service Features
Our Experience
No matter how complex your assignment is, we can find the right professional for your specific task. Contact Essay is an essay writing company that hires only the smartest minds to help you with your projects. Our expertise allows us to provide students with high-quality academic writing, editing & proofreading services.Free Features
Free revision policy
$10Free bibliography & reference
$8Free title page
$8Free formatting
$8How Our Essay Writing Service Works
First, you will need to complete an order form. It's not difficult but, in case there is anything you find not to be clear, you may always call us so that we can guide you through it. On the order form, you will need to include some basic information concerning your order: subject, topic, number of pages, etc. We also encourage our clients to upload any relevant information or sources that will help.
Complete the order formOnce we have all the information and instructions that we need, we select the most suitable writer for your assignment. While everything seems to be clear, the writer, who has complete knowledge of the subject, may need clarification from you. It is at that point that you would receive a call or email from us.
Writer’s assignmentAs soon as the writer has finished, it will be delivered both to the website and to your email address so that you will not miss it. If your deadline is close at hand, we will place a call to you to make sure that you receive the paper on time.
Completing the order and download