Exploring the relationship of drivers and mitigation measures with greenhouse gas emissions for European Union countries through exploratory data analysis and advanced statistical techniques namely Ordinary Least Square (OLS) Regression and Panel Data Regresion
Global warming is expected to result in a rise of the average global temperature between 1.1 to 6.4 degree celsius over the century, if there are no interventions taken to reduce emissions of greenhouse gases. Greenhouse gases are contributed by multiple drivers which include economic activities such as electricity production, transportation and waste generation. Carbon dioxide is the primary greenhouse gas emitted through human activities.
In December 2020, the European Union (EU) leaders committed to an ambitious goal of reducing greenhouse gases by 55% by 2030 to tackle climate change. The EU is currently the world’s third biggest emitter of greenhouse gases. Measures, such as emission taxes and use of renewable energy sources, were introduced to mitigate the greenhouse gas emissions, especially carbon dioxide emissions.
A large amount of climate change related statistics are available on Eurostat, with data across multiple domains such as environmental, social and economic statistics. Monitoring of the reduction of greenhouse gas emissions can be performed and is essential for tracking EU’s progress towards achieving its 2030 goal. Analysis of the impacts of the drivers and mitigation using the available statistics also allowed for insights on the determinants of the greenhouse gas emissions for the EU countries.
The project aims to deliver an R-Shiny app that provides interactive user interface design to:
Statistics from Eurostat covering a wide range of domains are used in this R Shiny Application. For our project, we will focus on datasets under greenhouse gas emissions, drivers and mitigation for all EU countries.
A generalized development timeframe for this project is shown below:
The proposed layout of the R Shiny app consists of 3 main tabs in the tab panel
Under the EDA tab, there will be another tab panel comprising of 4 tabs shown below:
Data | Proposed Visualisation |
---|---|
Greenhouse Gases Emission | ![]() |
Green Energy Production | ![]() |
Correlation Plot of Drivers and Mitigation | ![]() |
Scatter Plot for Drivers and Mitigation | ![]() |
The following shows the proposed first module under OLS regression where users can explore different methods to determine the set of independent variables for their model. Proposed type of visualization and rationales are as follow:
Thereafter, users will input their preferred dependent and independent variables to generate the regression estimations. At the same time, a series of validation/assumption checks will be in place for the users to determine if the chosen model is appropriate.
The Panel Data Regression module allows users to explore different time periods, countries and set of independent variables to run both the fixed effect and random effect model. The following shows the proposed layout for the module, where the selection of the relevant specifications is on the left panel and the visualisation along with the statistical output will be displayed on the right area.
Evaluation of Model Fit Assessment using the proposed list of tests below will also be shown.
Additional validation tests will also be performed to assess if there is any violation of assumptions. The list of validation tests for the panel models include: