Projects

Sentiment Analysis Dataset for OpenAI Finetuning
OpenAI finetuning in action — labeled examples for sentiment classification.
285 kB·2025
View on Kaggle
SWE Bench Verified
Evaluates an AI model's ability to solve real-world software issues.
1 MB·2024
View on Kaggle
Acquired Podcast Transcripts and RAG Evaluation
LLM RAG evaluation set with human-generated Q&A over podcast transcripts.
14 MB·2024
View on Kaggle
NBA Play-by-Play 2019-2020 Season
Comprehensive play-by-play data covering every move of the season.
5 MB·2024
View on Kaggle
Unsplash Photos
1k and 5k resized and compressed photo sets for image analytics.
9 GB·2023
View on Kaggle
Quasi-experimental Methods
A practical guide to propensity score matching and DID in Python and R.
25 kB·2022
View on Kaggle
World Cities
Information about ~41,000 places around the world.
1 MB·2022
View on Kaggle
Superstore Sales Data
Classic sales transactions sample for BI dashboards and visualization practice.
560 kB·2022
View on Kaggle
Crypto Coven
When data science meets NFT — Crypto Coven traits and metadata.
3 GB·2022
View on Kaggle
Wine Dataset for Clustering
Cluster wines based on their chemical constituents.
4 kB·2020
View on Kaggle
California Housing Data (1990)
California housing price prediction — a classic teaching dataset.
410 kB·2018
View on Kaggle