Publications of Jérôme Darmont
J. Darmont, C. Favre, S. Loudcher, C. Noûs, "Data Lakes for Digital Humanities", 2nd International Digital Tools and Uses Congress (DTUC 2020), Hammamet, Tunisia, October 2020, 38-41; ACM, New York (Data and Digital Humanities Track - Video).
Traditional data in Digital Humanities projects bear various formats (structured, semi-structured, textual) and need substantial transformations (encoding and tagging, stemming, lemmatization, etc.) to be managed and analyzed. To fully master this process, we propose the use of data lakes as a solution to data siloing and big data variety problems. We describe data lake projects we currently run in close collaboration with researchers in humanities and social sciences and discuss the lessons learned running these projects.
Data lakes, Digital humanities, Metadata
[ BibTeX | XML | Full paper | Back ]