Skip to content
#

data-exploration-and-preprocessing

Here are 25 public repositories matching this topic...

vector-io

Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, backup, re-embed (using any model) or access your vector data from any vector databases or repository.

  • Updated Feb 10, 2025
  • Jupyter Notebook

Successfully developed a machine learning model which can accurately predict whether a firm will become bankrupt or not, depending on various features such as net value growth rate, borrowing dependency, cash/total assets, etc.

  • Updated Oct 22, 2023
  • Jupyter Notebook

The Employee Attrition Control project uses data analysis and predictive modeling to understand and address employee turnover. It provides insights and recommendations to reduce attrition and improve employee satisfaction and retention.

  • Updated Jun 16, 2023
  • Jupyter Notebook

Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.

  • Updated Oct 27, 2023
  • Jupyter Notebook

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

  • Updated Oct 15, 2023
  • Jupyter Notebook

Successfully established a text summarization model using Seq2Seq modeling with Luong Attention, which can give a short and concise summary of the global news headlines.

  • Updated May 6, 2024
  • Jupyter Notebook

Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.

  • Updated Jan 31, 2025
  • Jupyter Notebook

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

  • Updated Oct 26, 2023
  • Jupyter Notebook

This project consists on exploratory data analysis and the application of supervised learning models for classification using a Hepatocellular Carcinoma dataset. Second Semester of the First Year of the Bachelor's Degree in Bioinformatics at FCUP.

  • Updated May 21, 2024
  • Jupyter Notebook

Successfully established a machine learning model using PySpark which can accurately classify whether a bank customer will churn or not up to an accuracy of more than 86% on the test set.

  • Updated Aug 4, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-exploration-and-preprocessing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-exploration-and-preprocessing topic, visit your repo's landing page and select "manage topics."

Learn more