Lahman-package {Lahman}R Documentation

Sean Lahman's Baseball Database

Description

This database contains pitching, hitting, and fielding statistics for Major League Baseball from 1871 through 2012. It includes data from the two current leagues (American and National), the four other "major" leagues (American Association, Union Association, Players League, and Federal League), and the National Association of 1871-1875.

This database was created by Sean Lahman, who pioneered the effort to make baseball statistics freely available to the general public. What started as a one man effort in 1994 has grown tremendously, and now a team of researchers have collected their efforts to make this the largest and most accurate source for baseball statistics available anywhere.

This database, in the form of an R package offers a variety of interesting challenges and opportunities for data processing and visualization in R.

Details

Package: Lahman
Type: Package
Version: 2.0-1
Date: 2013-04-18
License: GPL version 2 or newer
LazyLoad: yes
LazyData: yes

The main form of this database is a relational database in Microsoft Access format. The design follows these general principles. Each player is assigned a unique code (playerID). All of the information in different tables relating to that player is tagged with his playerID. The playerIDs are linked to names and birthdates in the Master table. Similar links exist among other tables via analogous *ID variables.

The database is comprised of the following main tables:

Master

Player names, dates of birth, death and other biographical info

Batting

batting statistics

Pitching

pitching statistics

Fielding

fielding statistics

A collection of other tables is also provided:

Teams:

Teams yearly stats and standings
TeamsHalf split season data for teams
TeamsFranchises franchise information

Post-season play:

BattingPost post-season batting statistics
PitchingPost post-season pitching statistics
FieldingPost post-season fielding data
SeriesPost post-season series information

Awards:

AwardsManagers awards won by managers
AwardsPlayers awards won by players
AwardsShareManagers award voting for manager awards
AwardsSharePlayers award voting for player awards

Hall of Fame: links to Master via hofID

HallOfFame Hall of Fame voting data

Others tables:

AllstarFull - All-Star games appearances; Managers - managerial statistics; FieldingOF - outfield position data; ManagersHalf - split season data for managers; Salaries - player salary data; Appearances - data on player appearances; Schools - Information on schools players attended; SchoolsPlayers - Information on schools players attended, by player;

Variable label tables are provided for some of the tables:

battingLabels, pitchingLabels, fieldingLabels

Author(s)

Michael Friendly and Dennis Murphy

Maintainer: Michael Friendly <friendly@yorku.ca>

Source

Lahman, S. (2010) Lahman's Baseball Database, 1871-2012, Main page, http://baseball1.com/statistics/

Lahman, S. (2010) Lahman's Baseball Database, 1871-2012, v. 2012, Comma-delimited version, http://seanlahman.com/files/database/lahman2012-csv.zip

Lahman, S. (2010) Lahman's Baseball Database, 1871-2012, MS Access version, http://seanlahman.com/files/database/lahman2012-ms.zip


[Package Lahman version 2.0-1 Index]