Understanding the Leading Causes of Death in the US

Code
import pandas as pd
import plotly.express as px
data = pd.read_csv('NCHS_-_Leading_Causes_of_Death__United_States.csv')
df_us = data[data.State == 'United States']
df = df_us.groupby('Cause Name').sum().reset_index()
df = df[df['Cause Name'] != 'All causes']


fig = px.pie(df, names='Cause Name', values='Deaths', title='Leading Causes of Death in the US from 1999 to 2017')
fig.update_traces(textposition='outside', textinfo='percent+label')
fig.update_layout(font_family='Times New Roman',  # Font for axis labels and tick labels
                  font_size=14)
fig.show()

The chart above illustrates the primary causes of death in the United States from 1999 to 2017, presenting a comprehensive view of the public health challenges during this period. The pie chart breaks down various causes of death, with each segment representing a particular cause and its corresponding percentage of the total deaths.

Key Findings:

  1. Heart Disease (33.7%): Heart disease ranks as the leading cause of death, accounting for a significant 33.7% of total deaths. This underscores the critical impact of cardiovascular health issues in the US. Factors contributing to heart disease include high blood pressure, obesity, diabetes, and lifestyle choices like diet and physical activity.

  2. Cancer (29.9%): Cancer closely follows heart disease, responsible for nearly 30% of deaths. Various types of cancer, such as lung, breast, and colorectal, have contributed significantly to this statistic. Cancer prevention and early detection remain a major focus in healthcare.

  3. Stroke (7.52%): Stroke, with a 7.5% share of deaths, is another major health concern. Strokes can be prevented by managing risk factors such as high blood pressure, diabetes, and lifestyle habits, though they continue to affect a considerable portion of the population.

  4. Chronic Lower Respiratory Disease (CLRD) (7.15%): Diseases like chronic bronchitis, emphysema, and chronic obstructive pulmonary disease (COPD) fall under CLRD and account for 7.15% of deaths. Smoking is a leading cause, highlighting the importance of anti-smoking campaigns and preventive healthcare.

  5. Unintentional Injuries (6.47%): This category includes accidental deaths from motor vehicle accidents, falls, and drug overdoses. The high number of deaths from unintentional injuries emphasizes the need for better safety measures, public awareness, and policies to reduce accidents and overdose rates.

  6. Alzheimer’s Disease (4.12%): Alzheimer’s disease, a neurodegenerative disorder, accounts for 4.12% of total deaths. With an aging population, the prevalence of Alzheimer’s is expected to rise, placing greater demand on healthcare services and support systems for those affected.

  7. Diabetes (3.86%): Diabetes contributes to 3.86% of deaths, highlighting the impact of diet, lifestyle, and genetic factors. Better management of blood sugar levels and lifestyle interventions can reduce complications from diabetes.

  8. Influenza and Pneumonia (3.02%): Infectious diseases such as influenza and pneumonia represent 3.02% of deaths. This statistic is a reminder of the importance of vaccinations and timely medical intervention, especially for vulnerable groups like the elderly and those with weakened immune systems.

  9. Kidney Disease (2.37%): Kidney disease accounts for 2.37% of deaths. Diabetes and high blood pressure are major contributors to kidney disease, making it crucial to address these risk factors to prevent kidney-related complications.

  10. Suicide (1.92%): Suicide, with a 1.92% share, reflects ongoing mental health challenges in the US. The relatively high rate indicates a need for more accessible mental health resources, support systems, and public awareness to address this critical issue.

We can also observe on the graph below, the evolution of the number of deaths over time:

Code
df_time = df_us.groupby(['Year', 'Cause Name']).sum().reset_index()
df_time = df_time[df_time['Cause Name'] != 'All causes']
fig = px.line(df_time, x='Year', y='Deaths', color='Cause Name', title='Deaths Over Time by Cause')
fig.update_traces(mode='lines+markers')
fig.update_layout(title='Deaths Over Time by Cause (1999-2017)',
                  xaxis_title='Year',
                  yaxis_title='Number of Deaths',
                  font_family='Times New Roman',
                  font_size=14,
                  xaxis=dict(tickmode='linear', tick0=1999, dtick=1, range=[1999, 2017],  tickangle=45),
                  yaxis=dict(range=[0, 800000], dtick=100000),
                  width=800,  # Adjust the width as needed
                  height=500)  # Adjust the height as needed
fig.show()

Conclusion

This chart highlights the primary health challenges the US faced between 1999 and 2017. It emphasizes the importance of preventive measures, public health policies, and healthcare innovations to address these leading causes of death. From heart disease and cancer to mental health and accidental injuries, each category represents an area where improvements in healthcare access, lifestyle choices, and public awareness can make a meaningful difference in reducing mortality rates.

Sources

Tejada-Vera, B. Leading Causes of Death: United States, 1999–2017. National Center for Health Statistics, 2019.