HomeAboutBlogProjects

Projects

ProductsToolsDatasets

Sentiment Analysis Dataset for OpenAI Finetuning

OpenAI finetuning in action — labeled examples for sentiment classification.

285 kB·2025
View on Kaggle

SWE Bench Verified

Evaluates an AI model's ability to solve real-world software issues.

1 MB·2024
View on Kaggle

Acquired Podcast Transcripts and RAG Evaluation

LLM RAG evaluation set with human-generated Q&A over podcast transcripts.

14 MB·2024
View on Kaggle

NBA Play-by-Play 2019-2020 Season

Comprehensive play-by-play data covering every move of the season.

5 MB·2024
View on Kaggle

Unsplash Photos

1k and 5k resized and compressed photo sets for image analytics.

9 GB·2023
View on Kaggle

Quasi-experimental Methods

A practical guide to propensity score matching and DID in Python and R.

25 kB·2022
View on Kaggle

World Cities

Information about ~41,000 places around the world.

1 MB·2022
View on Kaggle

Superstore Sales Data

Classic sales transactions sample for BI dashboards and visualization practice.

560 kB·2022
View on Kaggle

Crypto Coven

When data science meets NFT — Crypto Coven traits and metadata.

3 GB·2022
View on Kaggle

Wine Dataset for Clustering

Cluster wines based on their chemical constituents.

4 kB·2020
View on Kaggle

California Housing Data (1990)

California housing price prediction — a classic teaching dataset.

410 kB·2018
View on Kaggle

© 2026 Harry Wang