Difference between revisions of "Data Preparation Procedures"

From Wikibase.slis.ua.edu
Jump to navigation Jump to search
Line 34: Line 34:
 
## Further preparation procedures to come later, including for player participation data
 
## Further preparation procedures to come later, including for player participation data
  
==== Category 2 Games (with play-by-play data source requiring transcription) ====
+
==== Category 2 Games (with play-by-play data source requiring transcription prior to 2001) ====
  
 
Football games in this category occurred prior to 2001 and have paper-based play-by-play data sources that require transcribing to spreadsheets.  
 
Football games in this category occurred prior to 2001 and have paper-based play-by-play data sources that require transcribing to spreadsheets.  

Revision as of 15:15, 21 December 2018

This page provides information on data preparation for using QuickStatement.

Procedures vary based on the year that a game occurred.

IMPORTANT: Before creating an Item page, be to to search to be sure it doesn't already exist!


PRELIMINARY STEPS

  1. Make sure a season page exists for each team participating in the game: Indexing a Football Team Season
  2. Creating an item page for a game: Indexing a Football Game

PROCEDURES

Spreadsheet preparation procedures - downloading/acquiring and then preparing spreadsheets based on time period within which each game occurred. Procedures include data cleaning and uploading cleaned data to our Wikibase instance using our QuickStatements tool.

Category 1 Games (with available JSON-encoded play-by-play data sources - 2001 to present)

Football games in this category occurred 2001 to present and have JSON-encoded play-by-play data sources. This category further subdivides based on the availability of wall clock data for each play (games occurring 2014 to present).

  1. Games 2014 to present (with wall clock data):
    1. Preparing Spreadsheets for Games 2014 to Present
    2. Preparing Drive Creation Spreadsheets for Games 2014 to Present
    3. Preparing Drive Data Spreadsheets for Games 2014 to Present
    4. Preparing Play Creation Spreadsheets for Games 2014 to Present
    5. Preparing Play-by-play Data Spreadsheets for Games 2014 to Present
    6. Further preparation procedures to come later, including for player participation data
  2. Games 2001 to 2013 (no wall clock data):
    1. Preparing Spreadsheets for Games 2001 to 2013
    2. Preparing Drive Creation Spreadsheets for Games 2001 to 2013
    3. Preparing Drive Data Spreadsheets for Games 2001 to 2013
    4. Preparing Play Creation Spreadsheets for Games 2001 to 2013
    5. Preparing Play-by-play Data Spreadsheets for Games 2001 to 2013
    6. Further preparation procedures to come later, including for player participation data

Category 2 Games (with play-by-play data source requiring transcription prior to 2001)

Football games in this category occurred prior to 2001 and have paper-based play-by-play data sources that require transcribing to spreadsheets.

  1. Preparing Spreadsheets for Transcribed Games
  2. Preparing Drive Creation Spreadsheets for Transcribed Games
  3. Preparing Drive Data Spreadsheets for Transcribed Games
  4. Preparing Play Creation Spreadsheets for Transcribed Games
  5. Preparing Play-by-play Data Spreadsheets for Transcribed Games
  6. Further preparation procedures to come later, including for player participation data

Category 3 Games (with no play-by-play data sources)

Football games in this category occurred prior to 2001 and do not have any play-by-play data sources other than newspaper game accounts that require transcribing to spreadsheets.

  1. Preparing Spreadsheets for Historical Games
  2. Preparing Drive Creation Spreadsheets for Historical Games
  3. Preparing Drive Data Spreadsheets for Historical Games
  4. Preparing Play Creation Spreadsheets for Historical Games
  5. Preparing Play-by-play Data Spreadsheets for Historical Games
  6. Further preparation procedures to come later, including for player participation data

Data prep required for creating and populate each drive's item page using QuickStatements... two steps process:

    1. Create spreadsheet that will derive Q numbers for each drive and incorporating "instance of" statements: Drive Creation Procedure
    2. Create spreadsheet to populate data about each drive's (now) existing item page: Drive Data Preparation
  1. Play-by-play Data Preparation and Upload Procedures