Abstract

With the rapid development of the digital humanities (DH) field, demands for historical and cultural heritage data have generated deep interest the data provided by libraries, archives, and museums (LAMs). In order to enhance LAM data’s quality and discoverability while enabling a self-sustaining ecosystem, “semantic enrichment” becomes a strategy increasingly used by LAMs during recent years. This article introduces a number of semantic enrichment methods and efforts that can be applied to LAM data at various levels, aiming to support deeper and wider exploration and use of LAM data in DH research. The real cases, research projects, experiments, and pilot studies shared in this article demonstrate endless potential for LAM data, whether they are structured, semi-structured, or unstructured, regardless of what types of original artifacts carry the data. Following their roadmaps would encourage more effective initiatives and strengthen this effort to maximize LAM data’s discoverability, use- and reuse-ability, and their value in the mainstream of DH and Semantic Web.

Full document

The PDF file did not load properly or your web browser does not support viewing PDF files. Download directly to your device: Download PDF document

References

Albritton, Benjamin (2013). Digital manuscript interoperability: Shared canvas and IIIF in practice. https://slideplayer.com/slide/5840185

Alemu, Getaneh; Brett, Stevens; Ross, Penny; Chandler, Jane (2012). “Linked data for libraries: Benefits of a conceptual shift from library-specific record structures to RDF-based data models”. New library world, v. 113, n. 11/12, pp. 549-570. https://doi.org/10.1108/03074801211282920

Allen, Robert B. (2017). “Rich semantics and direct representation for digital collections”. In: ACM/IEEE Joint conference on digital libraries (JDCL), pp. 348-349. https://doi.org/10.1109/JCDL.2017.7991623

Appleby, Michael; Crane, Tom; Sanderson, Robert; Stroop, Jon; Warnet, Simeon (2012a). “IIIF Image API 2.1.1”. IIIF. https://iiif.io/api/image/2.1

Appleby, Michael; Crane, Tom; Sanderson, Robert; Stroop, Jon; Warnet, Simeon (2012b). “IIIF Presentation API 2.1.1”. IIIF. https://iiif.io/api/presentation/2.1

Bainbridge, David; Hinze, Annika; Cunningham, Sally-Jo; Downie, J. Stephen (2016). “Low-cost semantic enhancement to digital library metadata and indexing: Simple yet effective strategies”. In: 2016 ACM/IEEE Joint conference on digital libraries (JDCL), pp. 93-102. https://core.ac.uk/download/pdf/44290466.pdf

Baker, Thomas; Bermès, Emmanuelle; Coyle, Karen; Dunsire, Gordon; Isaac, Antoine; Murray, Peter; Panzer, Michael; Schneider, Jodi; Singer, Ross; Summers, Ed; Waites, William; Young, Jeff; Zeng, Marcia Lei (2011). Library linked data Incubator Group Final Report. W3C Incubator Group Report 25. http://www.w3.org/2005/Incubator/lld/XGR-lld-20111025

Bensmann, Felix; Zapilko, Benjamin; Mayr, Philipp (2017). “Interlinking large-scale library data with authority records”. Frontiers in digital humanities, n. 4, p. 5. https://doi.org/10.3389/fdigh.2017.00005

Borgman, Christine L. (2015). Big data, little data, no data: Scholarship in the networked world. Cambridge, MA: MIT Press. ISBN: 978 0 262529914

Burdick, Anne; Drucker, Johanna; Lunenfeld, Peter; Presner, Todd; Schnapp, Jeffrey (2012). Digital_Humanities. Cambridge, MA: MIT Press. ISBN: 978 0 262528863

Clarke, David (2015). “Deep image annotation: Making a difference in knowledge organization”. Fourth ISKO-UK Biennial conference of the International Society for Knowledge Organization. http://docplayer.net/13812285-Deep-image-annotation-making-a-difference-in-knowledge-organization.html

Consultative Committee for Space Data Systems (2012). Reference model for an Open Archival Information System. Washington DC: CCSDS. https://public.ccsds.org/Pubs/650x0m2.pdf

Damjanovic, Violeta; Kurz, Thomas; Westenthaler, Rupert; Behrendt, Wernher; Gruber, Andreas; Schaffert, Sebastian (2011). “Semantic enhancement: The key to massive and heterogeneous data pools”. In: Proceedings of the 20th intl IEEE ERK (Electrotechnical and Computer Science) conference, pp. 413-416. https://www.researchgate.net/publication/266603290_Semantic_Enhancement_The_Key_to_Massive_and_Heterogeneous_Data_Pools

Dunsire, Gordon; Willer, Mirna (2011). “Standard library metadata models and structures for the semantic web”. Library hi tech news, v. 28, n. 3, pp. 1-12. https://doi.org/10.1108/07419051111145118

Farias-Lóscio, Bernadette; Burle, Caroline; Calegari, Newton (2017). Data on the web best practices. W3C Recommendation 31 January 2017. http://www.w3.org/TR/dwbp

Floridi, Luciano (2010). Information: A very short introduction. Oxford: Oxford University Press. ISBN: 978 0 199551378

Gardner, Dan (2012). “An ocean of data [Introduction]”. In: Smolan, Rick; Erwitt, Jennifer (eds.). The human face of big data. Sausalito, CA: Against All Odds Productions, pp. 14-17. ISBN: 978 1 454908272

Gracy, Karen; Davidson, Sammy (2014). “Helping users find the ‘good stuff’: Using the semantic analysis method (SAM) tool to identify and extract potential access points from archival finding aids”. In: SAA Research Forum, Society of American Archivists. http://files.archivists.org/pubs/proceedings/ResearchForum/2014/posters/GracyDavidson-ResearchForumPoster2014.pdf

Gracy, Karen; Zeng, Marcia Lei (2015). “Creating linked data within archival description: Tools for extracting, validating, and encoding access points for finding aids”. Digital humanities conference of the Alliance of Digital Humanities Organizations (ADHO).

Gracy, Karen; Zeng, Marcia Lei; Skirvin, Laurence (2013). “Exploring methods to improve access to music resources by aligning library data with linked data: A report of methodologies and preliminary findings”. Journal of the American Society for Information Science and Technology (JASIS&T), v. 64, n. 10, pp. 2078-2099. https://doi.org/10.1002/asi.22914

Gruber, Ethan (2017). “Final report to the NEH for online coins of the Roman Empire”. Day of archaeology, July 28. http://www.dayofarchaeology.com/final-report-to-the-neh-for-online-coins-of-the-roman-empire

Hinze, Annika; Taube-Schock, Craig; Bainbridge, David; Matamua, Rangi; Downie, J. Stephen (2015). “Improving access to large-scale digital libraries through semantic-enhanced search and disambiguation”. In: Proceedings of the 15th ACM/IEEE-CS Joint conference on digital libraries. Association for Computational Linguistics, pp. 147-156. https://doi.org/10.1145/2756406.2756920

Hyvönen, Eero (2016). “Cultural heritage linked data on the semantic web: Three case studies using the sampo model”. VIII Encounter of documentation centres of contemporary art: open linked data and integral management of information in cultural centres. Artium, Vitoria-Gasteiz, Spain, October 19-20. https://seco.cs.aalto.fi/publications/submitted/hyvonen-vitoria-2017.pdf

Hyvönen, Eero; Heino, Erkki; Leskinen, Petri; Ikkala, Esko; Koho, Mikko; Tamper, Minna; Tuominen, Jouni; Mäkelä, Eetu (2016). “Publishing Second World War history as linked data events on the semantic web”. In: Proceedings of the digital humanities conference, pp. 571-573. https://seco.cs.aalto.fi/publications/2016/hyvonen-et-al-warsa-dh2016.pdf

Hyvönen, Eero; Leskinen, Petri; Tamper, Minna; Rantala, Heikki; Tuominen, Jouni; Keravuori, Kirsi (2018). “Demonstrating BiographySampo in solving digital humanities research problems in biography and prosopography” [Submitted paper]. https://seco.cs.aalto.fi/publications/submitted/hyvonen-et-al-bs-2019.pdf

Ikkala, Esko; Tuominen, Jouni; Raunamaa, Jaakko; Aalto, Tiina; Ainiala, Terhi; Uusitalo, Helinä; Hyvönen, Eero (2018). “NameSampo: A linked open data infrastructure and workbench for toponomastic research”. In: GeoHumanities 18, Proceedings of the 2nd ACM SIG Spatial workshop on geospatial humanities, Seattle, WA, USA, November 06-09, pp. 2:1-2:9, ACM. https://doi.org/10.1145/3282933.3282936

IMLS (2018). Transforming communities: IMLS strategic plan (2018-2022). Washington DC: Institute of Museum and Library Services. https://www.imls.gov/sites/default/files/publications/documents/imls-strategic-plan-2018-2022.pdf

Isaac, Antoine; Manguinhas, Hugo; Stiller, Juliane; Charles, Valentine (2015). Report on enrichment and evaluation. The Hague, Netherlands: Europeana Task Force on Enrichment and Evaluation. http://pro.europeana.eu/files/Europeana_Professional/EuropeanaTech/EuropeanaTech_taskforces/Enrichment_Evaluation/FinalReport_EnrichmentEvaluation_102015.pdf

Kaplan, Frederic (2015). “A map for big data research in digital humanities”. Frontiers in digital humanities, n. 2. https://doi.org/10.3389/fdigh.2015.00001

KBpedia (2018). KBpedia is now open source, October 23. http://kbpedia.org/resources/news/kbpedia-is-open-source

Kobielus, James (2016). “The evolution of big data to smart data”. In: Smart data online, July 13 [PowerPoint slides]. http://smartdata2016.dataversity.net

Lin, Yuri; Michel, Jean-Baptiste; Lieberman-Aiden, Erez; Orwant, Jon; Brockman, Will; Petrov, Slav (2012). “Syntactic annotations for the Google Books Ngram corpus”. In: Proceedings of the ACL 2012 System demonstrations. Association for Computational Linguistics, pp. 169-174. http://aclweb.org/anthology/P12-3029

Manguinhas, Hugo (ed.) (2016). Europeana semantic enrichment framework. Version 17, Nov. http://shorturl.at/pEIQ5

Mayer, Daniel (2011). Mainstream semantic enrichment [YouTube video]. December 26. http://www.youtube.com/watch?v=YVxvQ7UpqI0

Mayer-Schönberger, Viktor; Cukier, Kenneth (2013). Big data: A revolution that will transform how we live, work, and think. New York, NY: Eamon Dolan/Mariner Books. ISBN: 978 0 544227750

Mukerjee, Prithwis (2014). “Introduction to data science” [PowerPoint slides], January 12. http://www.slideshare.net/prithwis/01-intro2-datascienceyantrajaalblog

Mutuvi, Stephen; Doucet, Antoine; Odeo, Moses; Jatowt, Adam (2018). “Evaluating the impact of OCR errors on topic modeling”. In: Maturity and innovation in digital libraries. 20th Intl conf on Asia-Pacific digital libraries, ICADL 2018, Hamilton, New Zealand, November 19-22, Proceedings, pp. 3-14. ISBN: 978 3 030 04257 8

National Archives (2016). “Finding aid type”. The national archives catalog. https://www.archives.gov/research/catalog/lcdrg/elements/findingtype.html

Nguyen, Thi-Tuyet-Hai; Coustaty, Mickael; Doucet, Antoine; Jatowt, Adam; Nguyen, Nhu-Van (2018). “Adaptive edit-distance and regression approach for post-OCR text correction”. In: Maturity and innovation in digital libraries. 20th Intl conf on Asia-Pacific digital libraries, ICADL 2018, Hamilton, New Zealand, November 19-22, Proceedings, pp. 278-289. ISBN: 978 3 030 04257 8

O’Neill, Ed; Mixter, Jeff (2013). “Maximizing the usage of value vocabularies in the linked data ecosystem”. In: 76th Annual meeting of the American Society for Information Science and Technology (ASIS&T), Montreal, Canada, November. http://nkos.slis.kent.edu/ASIST2013/ONeill-Mixter.pptx

Pattuelli, M. Cristina (2012). “Personal name vocabularies as linked open data: A case study of jazz artist names”. Journal of information science, v. 38, n. 6, pp. 558-565. https://doi.org/10.1177/0165551512455989

Pattuelli, M. Cristina; Hwang, Karen; Miller, Matthew (2016). “Accidental discovery, intentional inquiry: Leveraging linked data to uncover the women of jazz”. Digital scholarship in the humanities, v. 32, n. 4, pp. 918-924. https://doi.org/10.1093/llc/fqw0

Prasad, A. R. D.; Giunchiglia, Fausto; Devika, P. Madalli (2017). “DERA: from document centric to entity centric knowledge modelling”. In: Proceedings of the International UDC seminar 2017. Faceted classification today. London, September, pp. 169-179. http://seminar.udcc.org

Riva, Pat; LeBoeuf, Patrick; Žumer, Maja (2017). IFLA library reference model: A conceptual model for bibliographic information. Netherlands: IFLA. https://www.ifla.org/files/assets/cataloguing/frbr-lrm/ifla-lrm-august-2017.pdf

Schöch, Christof (2013). “Big? Smart? Clean? Messy? Data in the Humanities”. Journal of digital humanities, v. 2, n. 3, pp. 2-13. http://journalofdigitalhumanities.org/2-3/big-smart-clean-messy-data-in-the-humanities

Smith-Yoshimura, Karen (2018). “The rise of Wikidata as a linked data source”. In: Hanging together. The OCLC research blog, August 6. http://hangingtogether.org/?p=6775

Stiller, Juliane; Petras, Vivien; Gäde, Maria; Isaac, Antoine (2014). “Automatic enrichments with controlled vocabularies in Europeana: Challenges and consequences.” In: Euro-Mediterranean conf., pp. 238-247. Springer, Cham. https://doi.org/10.1007/978-3-319-13695-0_23

Svensson, Patrik (2010). “The landscape of digital humanities”. Digital humanities quarterly, v. 4, n. 1. http://digitalhumanities.org/dhq/vol/4/1/000080/000080.html

Thorsen, Hilary K.; Pattuelli, M. Cristina (2016). “Linked open data and the cultural heritage landscape”. In: Jones, Ed; Seikel, Michele (eds.). Linked data for cultural heritage. Chicago, IL: Alcts Publishing. ISBN: 978 1 783301621

TiECON East (2014). Data is the new oil. https://tieconeast.wordpress.com/page/2

Reinhard, Andrew; Van-Alfen, Peter; Bransbourg, Gilles; Gruber, Ethan; (2017). “Wishes granted: the ANS and the NEH”. In: National Endowment for the Humanities. Announces. New grant recipients. http://numismatics.org/pocketchange/wp-content/uploads/sites/3/NEH-Article-ANS-Magazine.pdf

Van-Ruyskensvelde, Sarah (2014). “Towards a history of e-ducation? Exploring the possibilities of digital humanities for the history of education”. Paedagogica historica, v. 50, n. 6, pp. 861-870. https://doi.org/10.1080/00309230.2014.955511

Varner, Stewart; Hswe, Patricia (2016). Special report: Digital humanities in libraries. American Libraries. https://americanlibrariesmagazine.org/2016/01/04/special-report-digital-humanities-libraries

W3C (2011). Library Linked Data Incubator Group Final Report https://www.w3.org/2005/Incubator/lld/XGR-lld-20111025

W3C (2017). Data on the Web best practices. https://www.w3.org/TR/dwbp

Wagner, Elisabeth; Matsumoto, Mallory; Kiel, Nikolai; Gronemeyer, Sven (2014). A checklist of museums with Maya Art. http://mayawoerterbuch.de/museumscollections

Wang, Xiaoguang; Liu, Xuemei; Xia, Shengping (2017). “Design and implementation of deep semantic indexing on digital cultural heritage images”. Journal of library and information science, v. 43, n. 1, pp. 98-121. http://jlis.glis.ntnu.edu.tw/ojs/index.php/jlis/article/view/716

Weitz, Jay; Toves, Jenny; Vizine-Goetz, Diane; Naught, Nannette; Bremer, Robert (2016). “Mining MARC’s hidden treasures: Initial investigations into how notes of the past might shape our future”. Journal of library metadata, v. 16, n. 3-4, pp. 166-180. https://doi.org/10.1080/19386389.2016.1262653

Zeng, Marcia Lei (2017). “Smart data for digital humanities”. Journal of data and information science, v. 2, n. 1, pp. 1-12. https://doi.org/10.1515/jdis-2017-0001

Zeng, Marcia Lei; Gracy, Karen; Skirvin, Laurence (2013). “Navigating the intersection of library bibliographic data and linked music information sources: A study in the identification of useful metadata elements for interlinking”. Journal of library metadata, v. 13, n. 2-3, pp. 254-278. https://doi.org/10.1080/19386389.2013.827513

Zeng, Marcia Lei; Gracy, Karen F.; Žumer, Maja (2014). “Using a semantic analysis tool to generate subject access points: A study using Panofsky’s theory and two research samples”. Knowledge organization, v. 41, n. 6, pp. 440-451. https://pdfs.semanticscholar.org/bbeb/42b931fd32520a03167770d2b5de694128e6.pdf

Zeng, Marcia Lei; Mayr, Philipp (2018). “Knowledge organization systems (KOS) in the semantic web”. International journal on digital libraries. https://doi.org/10.1007/s00799-018-0241-2

Žumer, Maja (2018). “IFLA library reference model (IFLA LRM): Harmonisation of the FRBR family”. Knowledge organization, v. 45, n. 4, pp. 310-318. Also available in Hjørland, Birger (ed.). ISKO Encyclopedia of knowledge organization. http://www.isko.org/cyclo/lrm

Žumer, Maja; Riva, Pat (2017). “IFLA LRM-Finally here”. In: Intl conf on Dublin Core and metadata applications, Washington, D.C., USA, 26-29 October, pp. 13-23. http://dcpapers.dublincore.org/pubs/article/download/3852/2037

Back to Top

Document information

Published on 07/01/19
Accepted on 07/01/19
Submitted on 07/01/19

Volume 28, Issue 1, 2019
DOI: 10.3145/epi.2019.ene.03
Licence: CC BY-NC-SA license

Document Score

0

Views 2
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?