Difference between revisions of "2018-19 Academic Year Research Report"
Jump to navigation
Jump to search
(18 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
− | <h1><center>Data-driven Semantic Indexing of Football Images:<br> | + | <h1><center>Data-driven Semantic Indexing of Football Images from the Entire 2017 Alabama Crimson Tide Football Season:<br> |
An Experiment in Linked Data Using a Local Wikibase Instance</center></h1><br> | An Experiment in Linked Data Using a Local Wikibase Instance</center></h1><br> | ||
− | + | <b>[https://wikibase.slis.ua.edu/ Linked Data Research Group]</b><br> | |
− | + | <b>[https://slis.ua.edu/ School of Library and Information Studies]</b><br> | |
− | + | <b>[https://cis.ua.edu/ College of Communication and Information Sciences]</b><br> | |
− | + | University of Alabama<br> | |
− | + | <br> | |
Our aim with this experiment was to demonstrate how statistical play-by-play data generated during a football game could be incorporated into a semantic indexing and [https://wikibase.slis.ua.edu/wiki/Plays_with_Example_UA_Images_May_2019 identification] process for the photos and video clips captured during those games. We chose to deploy a linked data approach using Wikibase as our software. | Our aim with this experiment was to demonstrate how statistical play-by-play data generated during a football game could be incorporated into a semantic indexing and [https://wikibase.slis.ua.edu/wiki/Plays_with_Example_UA_Images_May_2019 identification] process for the photos and video clips captured during those games. We chose to deploy a linked data approach using Wikibase as our software. | ||
Line 13: | Line 13: | ||
Outline of research accomplishments resulting from this experiment: | Outline of research accomplishments resulting from this experiment: | ||
− | # '''<u>Progress Report: RGC grant-funded research accomplishments thus far.</u>''' We were able to successfully transform [https:// | + | # '''<u>Progress Report: RGC grant-funded research accomplishments thus far.</u>''' We were able to successfully transform [https://api.collegefootballdata.com/api/docs/?url=/api-docs.json Web-accessible] JSON-encoded statistical play-by-play datasets for all 14 games in the 2017 Alabama Crimson Tide football season into a [https://wikibase.slis.ua.edu/wiki/2017_Alabama_Crimson_Tide_football_team linked data application] using Wikibase. '''One can now navigate from play to play, drive to drive, and game to game within the [https://wikibase.slis.ua.edu/wiki/2017_Alabama_Crimson_Tide_football_team 2017 football season] using data drawn from existing statistical play-by-play datasets incorporated into the application by way of linked data methods.''' Highlights: |
− | |||
## [https://wikibase.slis.ua.edu/w/index.php?title=Special:ListProperties/&limit=100&offset=0 Property list] for the ontology developed for this application. | ## [https://wikibase.slis.ua.edu/w/index.php?title=Special:ListProperties/&limit=100&offset=0 Property list] for the ontology developed for this application. | ||
## Examples of three types of entities ("play" "drive" "game") in our linked data application. Each example provides links to Mediawiki site pages and to Wikibase item pages (each item page contains the metadata statements (each statement a "triple") about each entity using properties drawn from our ontology): | ## Examples of three types of entities ("play" "drive" "game") in our linked data application. Each example provides links to Mediawiki site pages and to Wikibase item pages (each item page contains the metadata statements (each statement a "triple") about each entity using properties drawn from our ontology): | ||
Line 24: | Line 23: | ||
### For [https://wikibase.slis.ua.edu/w/index.php?title=Template:Infobox_drive&action=edit a drive] | ### For [https://wikibase.slis.ua.edu/w/index.php?title=Template:Infobox_drive&action=edit a drive] | ||
### For [https://wikibase.slis.ua.edu/w/index.php?title=Template:Infobox_game&action=edit a game] | ### For [https://wikibase.slis.ua.edu/w/index.php?title=Template:Infobox_game&action=edit a game] | ||
− | ## Example SPARQL queries for retrieving | + | ## Example SPARQL queries for retrieving plays from the 2017 Alabama Crimson Tide football season meeting a variety of query criteria ('''PLEASE NOTE: To run each query, click on the Blue Arrow icon in lower left portion of screen after clicking on links below'''): |
− | ### [https://tinyurl.com/ | + | ### [https://tinyurl.com/vut7upm All rushing touchdowns that went for over 50 yards] during the 2017 Alabama Crimson Tide football season (limit to those for which there are [https://tinyurl.com/tmthgch video clips]) |
− | ### [https:// | + | ### [https://tinyurl.com/ws9xknu All Jalen Hurts touchdown passes that went for over 25 yards] in games from the 2017 Alabama Crimson tide football season (Just those Jalen Hurts TD passes for which there are [https://tinyurl.com/vsnvejs video clips]). |
− | + | ### [https://tinyurl.com/yx5od34s All touchdown passes that went for over 10 yards] in the 2017 Alabama Crimson Tide football season (Just those plays for which there are [https://tinyurl.com/vxxdfzy video clips]). | |
− | |||
− | |||
# '''<u>Applied research results (i.e., outside of the RGC grant context): The use of linked data to provide access to multimedia assets</u>'''. One important result of our research work during this academic year is the demonstration of the use of linked data to provide access to multimedia assets that document individual plays. As shown above, individual plays can be discovered by navigating the linked data application or by using SPARQL to query the triplestore. In the examples below, you will find video clips for each play thus demonstrating how linked data navigation or SPARQL querying methods can lead to multimedia assets that document those plays: | # '''<u>Applied research results (i.e., outside of the RGC grant context): The use of linked data to provide access to multimedia assets</u>'''. One important result of our research work during this academic year is the demonstration of the use of linked data to provide access to multimedia assets that document individual plays. As shown above, individual plays can be discovered by navigating the linked data application or by using SPARQL to query the triplestore. In the examples below, you will find video clips for each play thus demonstrating how linked data navigation or SPARQL querying methods can lead to multimedia assets that document those plays: | ||
## [https://wikibase.slis.ua.edu/wiki/Tua_Tagovailoa_pass_complete_to_DeVonta_Smith_for_27_yds_for_a_TD_(Andy_Pappanastos_KICK)_(9/23/17) Tua Tagovailoa pass complete to DeVonta Smith for 27 yds for a TD] | ## [https://wikibase.slis.ua.edu/wiki/Tua_Tagovailoa_pass_complete_to_DeVonta_Smith_for_27_yds_for_a_TD_(Andy_Pappanastos_KICK)_(9/23/17) Tua Tagovailoa pass complete to DeVonta Smith for 27 yds for a TD] |
Latest revision as of 14:15, 26 October 2022
Data-driven Semantic Indexing of Football Images from the Entire 2017 Alabama Crimson Tide Football Season:
An Experiment in Linked Data Using a Local Wikibase Instance
An Experiment in Linked Data Using a Local Wikibase Instance
Linked Data Research Group
School of Library and Information Studies
College of Communication and Information Sciences
University of Alabama
Our aim with this experiment was to demonstrate how statistical play-by-play data generated during a football game could be incorporated into a semantic indexing and identification process for the photos and video clips captured during those games. We chose to deploy a linked data approach using Wikibase as our software.
Outline of research accomplishments resulting from this experiment:
- Progress Report: RGC grant-funded research accomplishments thus far. We were able to successfully transform Web-accessible JSON-encoded statistical play-by-play datasets for all 14 games in the 2017 Alabama Crimson Tide football season into a linked data application using Wikibase. One can now navigate from play to play, drive to drive, and game to game within the 2017 football season using data drawn from existing statistical play-by-play datasets incorporated into the application by way of linked data methods. Highlights:
- Property list for the ontology developed for this application.
- Examples of three types of entities ("play" "drive" "game") in our linked data application. Each example provides links to Mediawiki site pages and to Wikibase item pages (each item page contains the metadata statements (each statement a "triple") about each entity using properties drawn from our ontology):
- Mediawiki site page for a typical play (here is this play's corresponding Wikibase item page)
- Mediawiki site page for a typical drive (here is this drive's corresponding Wikibase item page)
- Mediawiki site page for a typical game (here is this game's corresponding Wikibase item page)
- Infobox templates deployed to generate infoboxes on each Mediawiki page:
- Example SPARQL queries for retrieving plays from the 2017 Alabama Crimson Tide football season meeting a variety of query criteria (PLEASE NOTE: To run each query, click on the Blue Arrow icon in lower left portion of screen after clicking on links below):
- All rushing touchdowns that went for over 50 yards during the 2017 Alabama Crimson Tide football season (limit to those for which there are video clips)
- All Jalen Hurts touchdown passes that went for over 25 yards in games from the 2017 Alabama Crimson tide football season (Just those Jalen Hurts TD passes for which there are video clips).
- All touchdown passes that went for over 10 yards in the 2017 Alabama Crimson Tide football season (Just those plays for which there are video clips).
- Applied research results (i.e., outside of the RGC grant context): The use of linked data to provide access to multimedia assets. One important result of our research work during this academic year is the demonstration of the use of linked data to provide access to multimedia assets that document individual plays. As shown above, individual plays can be discovered by navigating the linked data application or by using SPARQL to query the triplestore. In the examples below, you will find video clips for each play thus demonstrating how linked data navigation or SPARQL querying methods can lead to multimedia assets that document those plays:
- Collaborator information. Important collaborators contributing to the research reported here:
- Dr. Greg Bott, Assistant Professor, UA Culverhouse School of Business. Dr. Bott is co-PI on the RGC grant providing data management expertise focusing on optimizing the efficiency of data wrangling methods using Python scripting
- David McMillan, IT Team Leader, Enterprise Development & Application Support, UA Office of Information Technology (OIT). David has been a long time collaborator beginning in the late 1990s when he was Systems Admin in the School of Library and Information Studies, and we were co-authors on a UA patent. In the current research project, David has contributed crucial support in the installation, optimization, and ongoing management of the Mediawiki/Wikibase instance hosted by UA OIT.
- Huapu Liu, Graduate Research Assistant and MLIS student. Huapu served as my graduate research assistant for the entire 2018-19 academic year serving as an indispensable collaborator in the development of our understanding of Wikibase and linked data, which was essentially a long series of trial and error steps. Huapu helped me compose over 25 pages of data wrangling procedures needed to transform the JSON-encoded statistical play-by-play datasets into linked data in Wikibase. He also did more than his share of data wrangling! (to run SPARQL query, click on the Blue Arrow icon in lower left portion of screen after clicking on link)
- Christine Schultz-Richert, MLIS student. Christine joined the research team in early April after hearing about this project in the presentation I made about it in our LS 566 Metadata course. In this short amount of time, Christine was able to accomplish quite a bit of wrangling data for us.