data-cl

Here are 12 public repositories matching this topic...

Prajwal18py / SMART-CSV-HEALTH-CHECKER

Enterprise-grade CSV data quality analyzer powered by Machine Learning. Automatic anomaly detection, statistical profiling, PII scanning, and actionable insights. Secure user authentication, custom data pipelines, and interactive dashboards. Production-ready SaaS application.

python machine-learning web-app plotly pandas data-visualization data-analysis su data-quality anamoly-detection scik streamlit data-cl csv-analysis

Updated Mar 7, 2026
Python

rakeshkapilavayi / DataPulse-Automated-EDA

Star

This project automates exploratory data analysis (EDA) with DataPulse, enabling users to upload, clean, and visualize datasets effortlessly. It integrates machine learning models like Logistic Regression and XGBoost for insightful analysis via an intuitive Streamlit interface.

data-science data machine-learning exploratory-data-analysis data-cl

Updated Dec 13, 2025
Python

Abhilashayagyaseni / KPMG-Data-Analysis-Project

Star

Generate valuable insights from customer and transactions data.

dashboard data-visualization data-analysis tableau data-exploration interpretation email-draft model-development power-point data-cl data-quality-assessment

Updated Jul 1, 2023

tusharborkar18 / sql-data-warehouse-project

Star

A comprehensive end-to-end data warehouse project using MYSQL, covering ETL pipelines, data modeling, and analytics/reporting.

mysql etl data-warehouse data-analytics data-analysis data-integration data-normalization sql-queries data-cl data-standardization medallion-architecture

Updated Dec 29, 2025

balinaanna / nyc-traffic-accidents

Star

Real-world open data analysis using Excel pivots, visuals, and insights.

excel exploratory-data-analysis traffic-analysis eda pivot-tables data-visualization open-data data-analytics spatial-analysis temporal-analysis real-world-data portfolio-project data-cl analytics-project

Updated Dec 8, 2025
Jupyter Notebook

gauravkamble-insights / Nashville-Housing-Data-Cleaning

Star

A data cleaning project using Microsoft SQL Server to standardize 56,000+ rows of Nashville Housing data. It demonstrates advanced T-SQL techniques (including self-joins, string parsing, and CTEs) to transform messy, unformatted records into a structured dataset ready for analysis.

sql data-transformation data-analysis joins ssms portfolio-project data-cl nashville-housing-data data-standardization

Updated May 11, 2026
SQL

babrai / marketing-campaign-analysis

Star

Анализ эффективности рекламных кампаний и проверка корректности атрибуции пользователей. Проект включает работу с маркетинговыми данными: установки пользователей, рекламные расходы, доходы и каналы привлечения.

sql jupyter-notebook pandas data-analysis marketing-analytics roas data-cl attribution-modeling marketing-metrics

Updated Sep 26, 2025
Jupyter Notebook

EmperorYeqing / Student-Performance-Analysis

Star

This project focuses on analyzing student academic performance data to identify factors that influence exam scores and overall achievement. Using Python and Pandas, the goal is to clean, explore, and analyze the dataset to answer important questions about student performance, study habits, attendance, and other contributing factors.

python education beginner-project exploratory-data-analysis pandas data-analytics data-analysis matplotlib portfolio-project correlation-analysis staistics data-cl student-performance

Updated Jun 15, 2026
Python

poshandew / Play-Store-Analysis

Star

In this our project we aimed to gather and analyze detailed information on apps in the Google Play Store in order to provide insights on app features and the current state of the Android app market.

python machine-learning sentiment-analysis exploratory-data-analysis pandas data-visualization seaborn merging-data nump matp data-cl

Updated Apr 16, 2025
Jupyter Notebook

alexspecter / lead_salvage_pipeline

Star

Deterministic data-cleaning pipeline for business leads, with a local LLM fallback for messy records. Memory-safe chunking, CSV/DOCX support, and CSV-injection sanitization.

python etl data-pipeline data-cl llm

Updated Jun 5, 2026
Python

chanronnie / BikeSalesDashboard_Excel

Star

dashboard excel data-analysis data-cl

Updated Oct 26, 2023

julianpaulussen / Matelda-Demo

Star

Matelda, an interactive system for multi-table error detection that combines automated error detection with human-in-the-loop refinement.

demonstration vldb error-detection streamlit data-cl vldb-conference streamlit-application vldb2025

Updated Sep 18, 2025
Python

Improve this page

Add a description, image, and links to the data-cl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-cl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-cl

Here are 12 public repositories matching this topic...

Prajwal18py / SMART-CSV-HEALTH-CHECKER

rakeshkapilavayi / DataPulse-Automated-EDA

Abhilashayagyaseni / KPMG-Data-Analysis-Project

tusharborkar18 / sql-data-warehouse-project

balinaanna / nyc-traffic-accidents

gauravkamble-insights / Nashville-Housing-Data-Cleaning

babrai / marketing-campaign-analysis

EmperorYeqing / Student-Performance-Analysis

poshandew / Play-Store-Analysis

alexspecter / lead_salvage_pipeline

chanronnie / BikeSalesDashboard_Excel

julianpaulussen / Matelda-Demo

Improve this page

Add this topic to your repo