Data Preparation Procedures

From Wikibase.slis.ua.edu
Revision as of 15:27, 21 December 2018 by Admin (talk | contribs)
Jump to navigation Jump to search

This page provides information on data preparation for using QuickStatement.

Procedures vary based on the year that a game occurred.

IMPORTANT: Before creating an Item page, be to to search to be sure it doesn't already exist!


PRELIMINARY STEPS

  1. Make sure a season page exists for each team participating in the game: Indexing a Football Team Season
  2. Creating an item page for a game: Indexing a Football Game

PROCEDURES

Spreadsheet preparation procedures - downloading/acquiring and then preparing spreadsheets based on time period within which each game occurred. Procedures include data cleaning and uploading cleaned data to our Wikibase instance using our QuickStatements tool.

Category 1 Games (with available JSON-encoded play-by-play data sources - 2001 to present)

Football games in this category occurred 2001 to present and have JSON-encoded play-by-play data sources. This category further subdivides based on the availability of wall clock data for each play (games occurring 2014 to present).

  1. Games 2014 to present (JSON files with wall clock data):
    1. Downloading and Initial Prepping of Spreadsheets for Games 2014 to Present
    2. Preparing Drive Creation Spreadsheets for Games 2014 to Present
    3. Preparing Drive Data Spreadsheets for Games 2014 to Present
    4. Preparing Play Creation Spreadsheets for Games 2014 to Present
    5. Preparing Play-by-play Data Spreadsheets for Games 2014 to Present
    6. Further preparation procedures to come later, including for player participation data
  2. Games 2001 to 2013 (JSON files without wall clock data):
    1. Downloading and Initial Prepping of Spreadsheets for Games 2001 to 2013
    2. Preparing Drive Creation Spreadsheets for Games 2001 to 2013
    3. Preparing Drive Data Spreadsheets for Games 2001 to 2013
    4. Preparing Play Creation Spreadsheets for Games 2001 to 2013
    5. Preparing Play-by-play Data Spreadsheets for Games 2001 to 2013
    6. Further preparation procedures to come later, including for player participation data

Category 2 Games (with play-by-play data sources requiring transcribing)

Football games in this category occurred prior to 2001 and have paper-based play-by-play data sources that require transcribing to spreadsheets.

  1. Transcribing Steps and Initial Prepping of Spreadsheets for Transcribed Games
  2. Preparing Drive Creation Spreadsheets for Transcribed Games
  3. Preparing Drive Data Spreadsheets for Transcribed Games
  4. Preparing Play Creation Spreadsheets for Transcribed Games
  5. Preparing Play-by-play Data Spreadsheets for Transcribed Games
  6. Further preparation procedures to come later, including for player participation data

Category 3 Games (with play-by-play data requiring reconstruction from newspaper accounts)

Football games in this category occurred prior to 2001 and do not have any play-by-play data sources other than newspaper game accounts that require transcribing to spreadsheets.

  1. Preparing Spreadsheets for Historical Games
  2. Preparing Drive Creation Spreadsheets for Historical Games
  3. Preparing Drive Data Spreadsheets for Historical Games
  4. Preparing Play Creation Spreadsheets for Historical Games
  5. Preparing Play-by-play Data Spreadsheets for Historical Games
  6. Further preparation procedures to come later, including for player participation data

Data prep required for creating and populate each drive's item page using QuickStatements... two steps process:

    1. Create spreadsheet that will derive Q numbers for each drive and incorporating "instance of" statements: Drive Creation Procedure
    2. Create spreadsheet to populate data about each drive's (now) existing item page: Drive Data Preparation
  1. Play-by-play Data Preparation and Upload Procedures