RESEARCH: Sample size—When less is more

R&A Studies

RESEARCH: Sample size—When less is more

RESEARCH: Sample size—When less is more

R&A Studies

RESEARCH: Sample size—When less is more

Arik Florimonte

Jan 11 2018 12:06am

RESEARCH: Sample size—When less is more

Arik Florimonte

Jan 11 2018

Introduction

In baseball forecasting, it is widely understood that more data is better when trying to model future performance. Today we examine that assumption for pitchers, and find that occasionally a smaller data set is actually better. We will also explore at what point the recent data becomes more significant than the historical data.

Methodology

We will use pitching data from 2010-2017, since 2010 is the season that Baseball Info Solutions began using an algorithm to classify quality-of-contact. For annual data, we’ll use pitchers with = 120 IP. For monthly data, we use only data from pitchers with = 25 IP in that month.

Throughout the article we will use R² as a measure of correlation between data sets. The R² value describes...

Almost!

You’re just a few clicks away from accessing this feature and hundreds more throughout the year that have a singular goal in mind: Winning your league. Subscribe to BaseballHQ.com here!

Already a subscriber? Sign in here

More From R&A Studies

R&A Studies

Draft-level analytics for 2025

Draft-level analytics for 2025

Outmaneuver your competitors by replacing watered-down ADP with detailed draft-level analytics for 12-team and 15-team mixed leagues.

Ed DeCaria

Mar 17 2025 8:10am

R&A Studies

Draft-level analytics for 2025

Draft-level analytics for 2025

Ed DeCaria

Mar 17 2025 8:10am

R&A Studies

Draft-level analytics for 2024

Draft-level analytics for 2024

Outmaneuver your competitors by replacing watered-down ADP with detailed draft-level analytics for 12-team and 15-team mixed leagues

Ed DeCaria

Mar 22 2024 12:55pm

R&A Studies

Draft-level analytics for 2024

Draft-level analytics for 2024

Ed DeCaria

Mar 22 2024 12:55pm

R&A Studies

Revisiting Expected Stolen Bases (xSB)

Revisiting Expected Stolen Bases (xSB)

Updating xSB in light of recent MLB rule changes.

Ed DeCaria

Feb 9 2024 3:02am

R&A Studies

Revisiting Expected Stolen Bases (xSB)

Revisiting Expected Stolen Bases (xSB)

Ed DeCaria

Feb 9 2024 3:02am

R&A Studies

Checking in on Pure Quality Starts

Checking in on Pure Quality Starts

The pitching landscape has shifted yet again, and our Pure Quality Start metric undergoes a minor shift to level-set the results.

Jon Enriquez

Dec 20 2023 10:10am

R&A Studies

Checking in on Pure Quality Starts

Checking in on Pure Quality Starts

Jon Enriquez

Dec 20 2023 10:10am

R&A Studies

RESEARCH: Draft-level analytics for 2023

RESEARCH: Draft-level analytics for 2023

2023 draft-level analytics from dozens of individual NFBC drafts to complete your last-mile preparation

Ed DeCaria

Mar 26 2023 1:00pm

R&A Studies

RESEARCH: Draft-level analytics for 2023

RESEARCH: Draft-level analytics for 2023

Ed DeCaria

Mar 26 2023 1:00pm

Tools

PlayerLink Search

2025 Projections

CSV Download Center

Custom Draft Guide

Historical Stats

LeagueSync (new MACK)

SP Matchup Scores

Ad Unit