Downloading and Initial Prepping of Spreadsheets for Category 1 Games 2014 to Present
Jump to navigation
Jump to search
Download JSON file and complete following steps to prepare FOUR spreadsheet (.csv) files:
- Go to online play-by-play data source and download a game's JSON file as follows:
- Click on desired year
- Click on JSON link
- Click on desired week
- Click on Full link
- Click on desired game
- Click on download link button in upper right hand corner
- The desired file will now be in your download folder
- To convert JSON to CSV, use this website: https://json-csv.com/ as follows:
- Click on Upload JSON file
- Navigate to your download folder and double click on JSON file downloaded in steps above
- Right click on downloaded file (lower left) and open
- Save file using year and the competing team names and annotate file name with "FULL_DATA_SPREADSHEET" as follows: 2017-ALA_AUB-FULL_DATA_SPREADSHEET
- Create "clean" version of the spreadsheet (that is, without hidden Excel codes) as follows:
- Highlight and copy all data (right click -> copy)
- Go to blank spreadsheet and paste special as "paste values" with no additional formatting
- Save file as csv: 2017-ALA_AUB-FULL_DATA_SPREADSHEET.csv
- Enter property numbers (PXX) at top of each relevant column:
- Best accomplished by copying and pasting row from known spreadsheet source
- PLEASE NOTE: For games from 2014 to 2016, there are two missing columns; THEREFORE, you must delete two cells from inserted row and shift cells to the left
- Add data to cells in these columns as follows:
- Column BA (P35): Overwrite existing data with drive sequence data (Do not include KOs as a drive)
- Column BB (P19): Overwrite existing data with play sequence data starting with Opening Kickoff as play #1
- Column BC (P36): Overwrite existing data with play sequence data (play # relative to each drive)
- End of periods, end of halfs, end of games, and timeouts are not counted as plays in either game or drive play sequences
- Transform data in these columns as follows:
- First, copy the entire column BT (P37) and paste it over the data in column BU (P46) (be sure to change the P37 heading of column BU back to P46)
- Column BT (P37): Strip out date data using the following example as an exemplar:
- For example, if the data in a column's cell is "2018-01-09T01:19:25Z"
- Then, do a "find and replace" in that column by pasting the string "2018-01-09T" into the FIND box and keep the REPLACE box empty
- Then, do another "find and replace" by typing "Z" into the FIND box and keep the REPLACE box empty
- Be sure to "replace all" values in each cell of the column for both find and replace
- This should leave you with just the time values in each column's cells ("01:19:25" would remain in the first cell of our example)
- Column BU (P46): Transform the data in this column from ISO to Unix time as follows:
- We will be using the following online utility to transform the data in this column: Epoch Batch Conversion Tool
- IMPORTANT: If there are any blank cells in column BU (P46), then this procedures must be done in parts without including blank cells in the batch conversion process
- In column BU (P46), highlight and copy cell with data stopping whenever coming upon a blank cell without data
- Paste the data in the box provided online at https://www.epochconverter.com/batch#human2epoch
- Change the output option from "Screen" to the second option ("(',' comma separated, use for Excel in the US)
- Click the "Convert" button at bottom
- Copy the transformed date data and paste it back over the copied cells into our spreadsheet in column BU (P46)
- Split column CO (P24/P25) into two columns by copying its contents and pasting it over the data in column CP
- Change the label of column CO (P24/P25) to column CO (P24)
- Change the label of column CP (P24/P25) to column CP (P25)
- Add data to column CP (P25) reflecting the other team from the data as shown in column CO (P24)
- Delete all columns right to left that are not labeled P(XX) leaving the following:
- Column AX (P8)
- Column AY (P10)
- Column AZ (P6)
- Column BA (P35)
- Column BB (P19)
- Column BC (P36)
- Column BD (P13)
- Column BF (P14)
- Column BI (P15)
- Column BL (P34)
- Column BN (P16)
- Column BR (P38)
- Column BS (P11)
- Column BT (P37)
- Column BU (P46)
- Column CE (P20)
- Column CF (P8)
- Column CI (P34)
- Column CJ (P13)
- Column AX (P8)
- Column AX (P8)
- Column AX (P8)
- Save FOUR copies of the spreadsheet as follows:
- Name the first copy to include "DRIVE_CREATION_SPREADSHEET" (e.g., 2017-ALA_AUB-DRIVE_CREATION_SPREADSHEET.csv)
- Name the second copy to include "DRIVE_DATA_SPREADSHEET" (e.g., 2017-ALA_AUB-DRIVE_DATA_SPREADSHEET.csv)
- Name the third copy to include "PLAY-BY-PLAY_CREATION_SPREADSHEET" (e.g., 2017-ALA_AUB-PLAY-BY-PLAY_CREATION_SPREADSHEET.csv)
- Name the fourth copy to include "PLAY-BY-PLAY_DATA_SPREADSHEET" (e.g., 2017-ALA_AUB-PLAY-BY-PLAY_DATA_SPREADSHEET.csv)
Navigation:
- Next procedural step: Preparing Drive Creation Spreadsheets for Category 1 Games 2014 to Present
- Return to Data Preparation Procedures