London Atmospheric Emissions Inventory Analysis

This repository contains a Jupyter notebook that tackles the problem of predicting atmospheric emissions in London given the emissions data for 2008, 2010, 2013, and 2020 specifically. The dataset was obtained from this link: https://data.london.gov.uk/dataset/london-atmospheric-emissions-inventory-2013. The dataset focuses on major road emissions only given the ease by which it was to record this information compared to rural roads.

Notebook Overview

Key Features:

Import Libraries (EDA):

Key libraries imported are pandas, numpy, scikit-learn, matplotlib, and torch.

Load data and concatenate:

Individual files were loaded in using pandas
We concatenated the data together since we plan to train on all the historical data to generate predictions.

Format Columns:

Convert all object type columns to a numeric format by first encoding each label and then converting the values to their encoded value.

Split Data:

Split the dataset so that any data that took place prior to 2020 would be considered historical and will be used for training while the rest of the data will be used as the test set for model evaluation.

Train Models:

Trained a Linear Regression model
Trained a Decision Tree Regressor model
Trained a simple feed forward neural network

Evaluate model performance

Used MSE, RMSE, MAE, and R2 score to evaluate model performances.

Technologies Used:

Deep Learning for Neural Network architecture and training
Machine Learning regression algorithms
Data visualization for interpretability

Requirements

To run the notebook, install the following Python libraries:

pandas
matplotlib
scikit-learn
numpy
torch

You can install the required libraries with:

pip install pandas matplotlib scikit-learn numpy torch

How to Use

Clone the repository:

git clone https://github.com/Vimal-Raghubir London-Atmospheric-Emissions-Inventory-Analysis.git
Open the Jupyter notebook:

jupyter notebook london_atmospheric_emissions_inventory_analysis.ipynb
Follow the structured steps in the notebook to:

Import the Libraries.
Load data and concatenate.
Format Columns.
Split Data.
Train Models
Evaluate model performance

Future Enhancements

Potential improvements to the notebook could include:

Trying different regression algorithms like XGBoost.
Adding more data from previous years not covered in the existing dataset like 2018, 2019, etc.
Trying bucketing and using classification algorithms to see if it results in improved performance.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
london_atmospheric_emissions_inventory_analysis.ipynb		london_atmospheric_emissions_inventory_analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

London Atmospheric Emissions Inventory Analysis

Notebook Overview

Key Features:

Technologies Used:

Requirements

How to Use

Future Enhancements

About

Releases

Packages

Languages

Vimal-Raghubir/London-Atmospheric-Emissions-Inventory-Analysis

Folders and files

Latest commit

History

Repository files navigation

London Atmospheric Emissions Inventory Analysis

Notebook Overview

Key Features:

Technologies Used:

Requirements

How to Use

Future Enhancements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages