Difference between revisions of "2018-19 Academic Year Research Report"

From Wikibase.slis.ua.edu
Jump to navigation Jump to search
 
(18 intermediate revisions by the same user not shown)
Line 1: Line 1:
  
<h1><center>Data-driven Semantic Indexing of Football Images:<br>
+
<h1><center>Data-driven Semantic Indexing of Football Images from the Entire 2017 Alabama Crimson Tide Football Season:<br>
 
An Experiment in Linked Data Using a Local Wikibase Instance</center></h1><br>
 
An Experiment in Linked Data Using a Local Wikibase Instance</center></h1><br>
  
* [https://smaccall.people.ua.edu Steven L. MacCall, PhD]
+
<b>[https://wikibase.slis.ua.edu/ Linked Data Research Group]</b><br>
* Associate Professor
+
<b>[https://slis.ua.edu/ School of Library and Information Studies]</b><br>
* School of Library and Information Studies
+
<b>[https://cis.ua.edu/ College of Communication and Information Sciences]</b><br>
* University of Alabama
+
University of Alabama<br>
* [mailto:smaccall@slis.ua.edu smaccall@slis.ua.edu]
+
<br>
  
 
Our aim with this experiment was to demonstrate how statistical play-by-play data generated during a football game could be incorporated into a semantic indexing and [https://wikibase.slis.ua.edu/wiki/Plays_with_Example_UA_Images_May_2019 identification] process for the photos and video clips captured during those games. We chose to deploy a linked data approach using Wikibase as our software.
 
Our aim with this experiment was to demonstrate how statistical play-by-play data generated during a football game could be incorporated into a semantic indexing and [https://wikibase.slis.ua.edu/wiki/Plays_with_Example_UA_Images_May_2019 identification] process for the photos and video clips captured during those games. We chose to deploy a linked data approach using Wikibase as our software.
Line 13: Line 13:
 
Outline of research accomplishments resulting from this experiment:
 
Outline of research accomplishments resulting from this experiment:
  
# '''<u>Progress Report: RGC grant-funded research accomplishments thus far.</u>''' We were able to successfully transform [https://drive.google.com/drive/folders/0B13YvT1olZIfZHp1cGRIVkdyV2M publicly accessible] JSON-encoded statistical play-by-play datasets for all 14 games in the 2017 Alabama Crimson Tide football season into a [https://wikibase.slis.ua.edu/wiki/2017_Alabama_Crimson_Tide_football_team linked data application] using Wikibase. '''One can now navigate from play to play, drive to drive, and game to game within the [https://wikibase.slis.ua.edu/wiki/2017_Alabama_Crimson_Tide_football_team 2017 football season] using data drawn from existing statistical play-by-play datasets incorporated into the application by way of linked data methods.''' Highlights:
+
# '''<u>Progress Report: RGC grant-funded research accomplishments thus far.</u>''' We were able to successfully transform [https://api.collegefootballdata.com/api/docs/?url=/api-docs.json Web-accessible] JSON-encoded statistical play-by-play datasets for all 14 games in the 2017 Alabama Crimson Tide football season into a [https://wikibase.slis.ua.edu/wiki/2017_Alabama_Crimson_Tide_football_team linked data application] using Wikibase. '''One can now navigate from play to play, drive to drive, and game to game within the [https://wikibase.slis.ua.edu/wiki/2017_Alabama_Crimson_Tide_football_team 2017 football season] using data drawn from existing statistical play-by-play datasets incorporated into the application by way of linked data methods.''' Highlights:
## Example of a [https://drive.google.com/file/d/0B13YvT1olZIfX3dJZUVPSUNDMmM/view?usp=sharing JSON-encoded statistical play-by-play dataset] from 2017 Alabama versus Florida State football game.
 
 
## [https://wikibase.slis.ua.edu/w/index.php?title=Special:ListProperties/&limit=100&offset=0 Property list] for the ontology developed for this application.  
 
## [https://wikibase.slis.ua.edu/w/index.php?title=Special:ListProperties/&limit=100&offset=0 Property list] for the ontology developed for this application.  
 
## Examples of three types of entities ("play" "drive" "game") in our linked data application. Each example provides links to Mediawiki site pages and to Wikibase item pages (each item page contains the metadata statements (each statement a "triple") about each entity using properties drawn from our ontology):
 
## Examples of three types of entities ("play" "drive" "game") in our linked data application. Each example provides links to Mediawiki site pages and to Wikibase item pages (each item page contains the metadata statements (each statement a "triple") about each entity using properties drawn from our ontology):
Line 24: Line 23:
 
### For [https://wikibase.slis.ua.edu/w/index.php?title=Template:Infobox_drive&action=edit a drive]
 
### For [https://wikibase.slis.ua.edu/w/index.php?title=Template:Infobox_drive&action=edit a drive]
 
### For [https://wikibase.slis.ua.edu/w/index.php?title=Template:Infobox_game&action=edit a game]
 
### For [https://wikibase.slis.ua.edu/w/index.php?title=Template:Infobox_game&action=edit a game]
## Example SPARQL queries for retrieving entity instances ('''to run each query, click on the Blue Arrow icon in lower left portion of screen after clicking on links below'''):
+
## Example SPARQL queries for retrieving plays from the 2017 Alabama Crimson Tide football season meeting a variety of query criteria ('''PLEASE NOTE: To run each query, click on the Blue Arrow icon in lower left portion of screen after clicking on links below'''):
### [https://tinyurl.com/wfwec32 Rushing touchdowns for plays over 50 yards during the 2017 Alabama Crimson Tide football season]
+
### [https://tinyurl.com/vut7upm All rushing touchdowns that went for over 50 yards] during the 2017 Alabama Crimson Tide football season (limit to those for which there are [https://tinyurl.com/tmthgch video clips])
### [https://wikibase.slis.ua.edu/sparql/#%23%20Touchdown%20passes%20involving%20Jalen%20Hurts%20for%20plays%20over%2050%20yards%0ASELECT%20%3FPlay%20%3FTextual_Description_of_Play_Outcome%20%3FYardage_Resulting_from_Play_Outcome%20%3FTeam_on_DefenseLabel%20WHERE%20%7B%0A%20%20%3FPlay%20wdt%3AP17%20wd%3AQ8%3B%20%20%23Is%20a%20play%0A%20%20%20%20%20%20%20%20wdt%3AP16%20wd%3AQ35%3B%20%20%23Touchdown%20pass%20play%0A%20%20%20%20%20%20%20%20wdt%3AP60%20wd%3AQ1355%3B%20%23Jalen%20Hurts%20as%20a%20participating%20player%0A%20%20%20%20%20%20%20%20wdt%3AP38%20%3Fplays__statYardage.%20%23%20Yardage%20gain%0A%20%20%20%20%20%20%20%20%20%20FILTER%28%3Fplays__statYardage%20%3E%3D%2050%29.%20%20%23Yardage%20gained%20%3E%3D%2050%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%20%20OPTIONAL%20%7B%20%3FPlay%20wdt%3AP20%20%3FTextual_Description_of_Play_Outcome.%20%7D%0A%20%20OPTIONAL%20%7B%20%3FPlay%20wdt%3AP38%20%3FYardage_Resulting_from_Play_Outcome.%20%7D%0A%20%20OPTIONAL%20%7B%20%3FPlay%20wdt%3AP25%20%3FTeam_on_Defense.%20%7D%0A%7D%0ALIMIT%20100 Touchdown passes involving Jalen Hurts for plays over 50 yards]
+
### [https://tinyurl.com/ws9xknu All Jalen Hurts touchdown passes that went for over 25 yards] in games from the 2017 Alabama Crimson tide football season (Just those Jalen Hurts TD passes for which there are [https://tinyurl.com/vsnvejs video clips]).
### This pair of SPARQL queries demonstrates filtering capabilities for plays without video clips vs those plays that do:
+
### [https://tinyurl.com/yx5od34s All touchdown passes that went for over 10 yards] in the 2017 Alabama Crimson Tide football season (Just those plays for which there are [https://tinyurl.com/vxxdfzy video clips]).
#### [https://tinyurl.com/rp6rkv6 All touchdown pass plays of greater than 10 yards]
 
#### [https://wikibase.slis.ua.edu/sparql/#%23%20Touchdown%20passes%20for%20plays%20over%2010%20yards%20with%20a%20video%20clips%20that%20show%20site%20pages%20to%20clips%0APREFIX%20schema%3A%20%3Chttp%3A%2F%2Fschema.org%2F%3E%0ASELECT%20%3FPlay%20%3FTextual_Description_of_Play_Outcome%20%3FYardage_Resulting_from_Play_Outcome%20%3FTeam_on_DefenseLabel%20%3FsitePage%20WHERE%20%7B%0A%20%20%3FPlay%20wdt%3AP17%20wd%3AQ8%3B%20%20%23Is%20a%20play%0A%20%20%20%20%20%20%20%20wdt%3AP16%20wd%3AQ35%3B%20%20%23Touchdown%20pass%20play%0A%20%20%20%20%20%20%20%20wdt%3AP75%20wd%3AQ5699%3B%0A%20%20%20%20%20%20%20%20wdt%3AP38%20%3Fplays__statYardage.%20%23%20Yardage%20gain%0A%20%20%3FsitePage%20schema%3Aabout%20%3FPlay.%0A%20%20%20%20%20%20%20%20%20%20FILTER%28%3Fplays__statYardage%20%3E%3D%2010%29.%20%20%23Yardage%20gained%20%3E%3D%2010%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%20%20OPTIONAL%20%7B%20%3FPlay%20wdt%3AP20%20%3FTextual_Description_of_Play_Outcome.%20%7D%0A%20%20OPTIONAL%20%7B%20%3FPlay%20wdt%3AP38%20%3FYardage_Resulting_from_Play_Outcome.%20%7D%0A%20%20OPTIONAL%20%7B%20%3FPlay%20wdt%3AP25%20%3FTeam_on_Defense.%20%7D%0A%7D%0ALIMIT%20100 All touchdown pass plays that have available video clips]
 
 
# '''<u>Applied research results (i.e., outside of the RGC grant context): The use of linked data to provide access to multimedia assets</u>'''. One important result of our research work during this academic year is the demonstration of the use of linked data to provide access to multimedia assets that document individual plays. As shown above, individual plays can be discovered by navigating the linked data application or by using SPARQL to query the triplestore. In the examples below, you will find video clips for each play thus demonstrating how linked data navigation or SPARQL querying methods can lead to multimedia assets that document those plays:
 
# '''<u>Applied research results (i.e., outside of the RGC grant context): The use of linked data to provide access to multimedia assets</u>'''. One important result of our research work during this academic year is the demonstration of the use of linked data to provide access to multimedia assets that document individual plays. As shown above, individual plays can be discovered by navigating the linked data application or by using SPARQL to query the triplestore. In the examples below, you will find video clips for each play thus demonstrating how linked data navigation or SPARQL querying methods can lead to multimedia assets that document those plays:
 
## [https://wikibase.slis.ua.edu/wiki/Tua_Tagovailoa_pass_complete_to_DeVonta_Smith_for_27_yds_for_a_TD_(Andy_Pappanastos_KICK)_(9/23/17) Tua Tagovailoa pass complete to DeVonta Smith for 27 yds for a TD]
 
## [https://wikibase.slis.ua.edu/wiki/Tua_Tagovailoa_pass_complete_to_DeVonta_Smith_for_27_yds_for_a_TD_(Andy_Pappanastos_KICK)_(9/23/17) Tua Tagovailoa pass complete to DeVonta Smith for 27 yds for a TD]

Latest revision as of 14:15, 26 October 2022

Data-driven Semantic Indexing of Football Images from the Entire 2017 Alabama Crimson Tide Football Season:
An Experiment in Linked Data Using a Local Wikibase Instance


Linked Data Research Group
School of Library and Information Studies
College of Communication and Information Sciences
University of Alabama

Our aim with this experiment was to demonstrate how statistical play-by-play data generated during a football game could be incorporated into a semantic indexing and identification process for the photos and video clips captured during those games. We chose to deploy a linked data approach using Wikibase as our software.

Outline of research accomplishments resulting from this experiment:

  1. Progress Report: RGC grant-funded research accomplishments thus far. We were able to successfully transform Web-accessible JSON-encoded statistical play-by-play datasets for all 14 games in the 2017 Alabama Crimson Tide football season into a linked data application using Wikibase. One can now navigate from play to play, drive to drive, and game to game within the 2017 football season using data drawn from existing statistical play-by-play datasets incorporated into the application by way of linked data methods. Highlights:
    1. Property list for the ontology developed for this application.
    2. Examples of three types of entities ("play" "drive" "game") in our linked data application. Each example provides links to Mediawiki site pages and to Wikibase item pages (each item page contains the metadata statements (each statement a "triple") about each entity using properties drawn from our ontology):
      1. Mediawiki site page for a typical play (here is this play's corresponding Wikibase item page)
      2. Mediawiki site page for a typical drive (here is this drive's corresponding Wikibase item page)
      3. Mediawiki site page for a typical game (here is this game's corresponding Wikibase item page)
    3. Infobox templates deployed to generate infoboxes on each Mediawiki page:
      1. For a play
      2. For a drive
      3. For a game
    4. Example SPARQL queries for retrieving plays from the 2017 Alabama Crimson Tide football season meeting a variety of query criteria (PLEASE NOTE: To run each query, click on the Blue Arrow icon in lower left portion of screen after clicking on links below):
      1. All rushing touchdowns that went for over 50 yards during the 2017 Alabama Crimson Tide football season (limit to those for which there are video clips)
      2. All Jalen Hurts touchdown passes that went for over 25 yards in games from the 2017 Alabama Crimson tide football season (Just those Jalen Hurts TD passes for which there are video clips).
      3. All touchdown passes that went for over 10 yards in the 2017 Alabama Crimson Tide football season (Just those plays for which there are video clips).
  2. Applied research results (i.e., outside of the RGC grant context): The use of linked data to provide access to multimedia assets. One important result of our research work during this academic year is the demonstration of the use of linked data to provide access to multimedia assets that document individual plays. As shown above, individual plays can be discovered by navigating the linked data application or by using SPARQL to query the triplestore. In the examples below, you will find video clips for each play thus demonstrating how linked data navigation or SPARQL querying methods can lead to multimedia assets that document those plays:
    1. Tua Tagovailoa pass complete to DeVonta Smith for 27 yds for a TD
    2. Da'Ron Payne 1 Yd pass from Jalen Hurts
    3. Damien Harris run for 75 yds for a TD
    4. Calvin Ridley 12 Yd pass from Jalen Hurts
    5. See also: Plays with Example UA Images May 2019
  3. Collaborator information. Important collaborators contributing to the research reported here:
    1. Dr. Greg Bott, Assistant Professor, UA Culverhouse School of Business. Dr. Bott is co-PI on the RGC grant providing data management expertise focusing on optimizing the efficiency of data wrangling methods using Python scripting
    2. David McMillan, IT Team Leader, Enterprise Development & Application Support, UA Office of Information Technology (OIT). David has been a long time collaborator beginning in the late 1990s when he was Systems Admin in the School of Library and Information Studies, and we were co-authors on a UA patent. In the current research project, David has contributed crucial support in the installation, optimization, and ongoing management of the Mediawiki/Wikibase instance hosted by UA OIT.
    3. Huapu Liu, Graduate Research Assistant and MLIS student. Huapu served as my graduate research assistant for the entire 2018-19 academic year serving as an indispensable collaborator in the development of our understanding of Wikibase and linked data, which was essentially a long series of trial and error steps. Huapu helped me compose over 25 pages of data wrangling procedures needed to transform the JSON-encoded statistical play-by-play datasets into linked data in Wikibase. He also did more than his share of data wrangling! (to run SPARQL query, click on the Blue Arrow icon in lower left portion of screen after clicking on link)
    4. Christine Schultz-Richert, MLIS student. Christine joined the research team in early April after hearing about this project in the presentation I made about it in our LS 566 Metadata course. In this short amount of time, Christine was able to accomplish quite a bit of wrangling data for us.