Project Proposal

Exploring the relationship of drivers and mitigation measures with greenhouse gas emissions for European Union countries through exploratory data analysis and advanced statistical techniques namely Ordinary Least Square (OLS) Regression and Panel Data Regresion

Published

Feb. 24, 2021

DOI

Background

Global warming is expected to result in a rise of the average global temperature between 1.1 to 6.4 degree celsius over the century, if there are no interventions taken to reduce emissions of greenhouse gases. Greenhouse gases are contributed by multiple drivers which include economic activities such as electricity production, transportation and waste generation. Carbon dioxide is the primary greenhouse gas emitted through human activities.

In December 2020, the European Union (EU) leaders committed to an ambitious goal of reducing greenhouse gases by 55% by 2030 to tackle climate change. The EU is currently the world’s third biggest emitter of greenhouse gases. Measures, such as emission taxes and use of renewable energy sources, were introduced to mitigate the greenhouse gas emissions, especially carbon dioxide emissions.

Motivation

A large amount of climate change related statistics are available on Eurostat, with data across multiple domains such as environmental, social and economic statistics. Monitoring of the reduction of greenhouse gas emissions can be performed and is essential for tracking EU’s progress towards achieving its 2030 goal. Analysis of the impacts of the drivers and mitigation using the available statistics also allowed for insights on the determinants of the greenhouse gas emissions for the EU countries.

Project Objectives

The project aims to deliver an R-Shiny app that provides interactive user interface design to:

Broad Data Description

Statistics from Eurostat covering a wide range of domains are used in this R Shiny Application. For our project, we will focus on datasets under greenhouse gas emissions, drivers and mitigation for all EU countries.

Proposed Scope and Methodology

  1. Analysis of Eurostat greenhouse emission dataset with background research
  2. Exploratory Data Analysis (EDA) methods in R
  3. Analysis of Ordinary Least Square regression in R
  4. Analysis of Panel data regression in R
  5. Development of validating checks and model selection in R for both Ordinary Least Square and Panel regression
  6. R-Shiny app development for user interactivity

A generalized development timeframe for this project is shown below:

Storyboard & Visualization Features

(A) Exploratory Data Analysis (EDA) methods in R

The proposed layout of the R Shiny app consists of 3 main tabs in the tab panel

Under the EDA tab, there will be another tab panel comprising of 4 tabs shown below:

Data Proposed Visualisation
Greenhouse Gases Emission
Green Energy Production
Correlation Plot of Drivers and Mitigation
Scatter Plot for Drivers and Mitigation

(B) Analysis of Cross-Sectional Data through Ordinary Least Square Regression in R

The following shows the proposed first module under OLS regression where users can explore different methods to determine the set of independent variables for their model. Proposed type of visualization and rationales are as follow:

Thereafter, users will input their preferred dependent and independent variables to generate the regression estimations. At the same time, a series of validation/assumption checks will be in place for the users to determine if the chosen model is appropriate.

(C) Analysis of Longitudinal Data through Panel Data Regression in R

The Panel Data Regression module allows users to explore different time periods, countries and set of independent variables to run both the fixed effect and random effect model. The following shows the proposed layout for the module, where the selection of the relevant specifications is on the left panel and the visualisation along with the statistical output will be displayed on the right area.

Evaluation of Model Fit Assessment using the proposed list of tests below will also be shown.

Additional validation tests will also be performed to assess if there is any violation of assumptions. The list of validation tests for the panel models include:

Proposed R Packages

Data Preparation

Exploratory data analysis

Ordinary Least Square regression

Panel data regression

Building Shiny App

Team Members

References

Footnotes