About Me

Hi, I’m Leslie! I’m a digital collections librarian specializing in metadata and data curation, with an MLIS from the University of Washington. I love finding ways to make resources more discoverable and accessible through metadata design, linked data approaches, and computational methods.
I started in digital collections on the University of Oregon Libraries’ Oregon Digital team, where I supported metadata creation and large-scale remediation projects. Since then, I’ve been exploring how computational methods can help us work with information at scale across different contexts. I’ve worked with cultural heritage materials, bibliographic data, and research datasets, with recent projects involving bibliographic analysis, linked open data work, and web archive research.
I’m drawn to work that improves how people find and use information, combining thoughtful metadata practices with emerging technologies and sustainable approaches. For a closer look at my background, check out my resume and the Projects section below.
Contact: ✉️ leslieharka@gmail.com
Projects
Selected projects in metadata work, digital curation, and computational analysis.
Analyzing ISMIR Metadata
(2024–2025)
Tools:
Python, SPARQL, RDF, Data Extraction
Links:
Project Site
Embedded Spreadsheets in Web Archives
(2025)
Current project developing methods to detect spreadsheets in End of Term Web Archive PDFs using multimodal models and computer vision. Published tutorial on accessing parquet data on AWS S3.
Tools:
Python, SQL, AWS S3, Web Archives, Data Extraction
Links:
Tutorial
Archival Processing Projects
(2021–2022)
Processed archival collections and created accession records and finding aids in ArchivesSpace during UO Special Collections Thomas Internship. Selected examples shared in the links below.
Tools:
ArchivesSpace, Metadata, Digital Archives, EAD
Links:
123
Oregon Digital Migration
(2021–2022)
Metadata auditing and remediation for thousands of cultural heritage materials during Oregon Digital platform migration to Samvera/Hyrax.
Tools:
Samvera/Hyrax, Metadata, Digital Archives
Links:
RepositoryLive Site
Doris Ulmann Spotlight Exhibit
(2020–2021)
Remediated metadata for 2,400+ Doris Ulmann photographs (early 20th-century American South). Created public digital exhibit in Spotlight.
Tools:
Spotlight, Metadata, Digital Archives, Exhibits
Links:
CollectionExhibitBlog Post