New York Times Best Seller Analysis
Project Overview
This project provides data collection and analysis of the New York Times Best Seller lists, tracking trends and patterns in bestselling fiction books over time. The data contains two lists:
- Hardcover Fiction (1931-present)
- Combined Print & E-Book Fiction (2011-present)
Key Research Questions
- How has the NYT bestseller list evolved over time?
- Which authors and titles have dominated the lists?
- What patterns can we observe in bestseller performance?
- How do hardcover bestsellers compare to e-book inclusive listings?
Data Collection
Get the Data
Data Sources
This project combines data from multiple sources:
- Web Scraping: Current bestseller data collected directly from the New York Times website
- Historical Archives: Hardcover fiction data (1931-2010) from the Post 45 Collective
Automated Updates
The dataset is automatically refreshed weekly through GitHub Actions workflows, ensuring the analysis remains current with the latest bestseller information.
Repository
The complete code and data for this project are available on GitHub.