Crowdsourcing based curation and user engagement in digital library design

Holley, Rose Crowdsourcing based curation and user engagement in digital library design., 2017 . In UNESCO-National Digital Library of India International Workshop ‘Knowledge Engineering for Digital Library Design’., New Delhi (India), 23-25 October 2017. [Presentation]

Personal_India NDL Oct17presandnotes.pdf

Download (4MB) | Preview
[img] Slideshow
Personal_India NDL Oct17.pptx

Download (6MB)
[img] Text
Personal_India NDL Abstract and links for Plaban-FINAL.docx

Download (201kB)

English abstract

A historical perspective on the development and success of Trove, including the original idea concept, the user centric design principles and social engagement with thousands of volunteers. The Trove service which is now ten years old is used by millions of Australians with the digitised Australian newspapers being the most popular resource. Rose Holley, Special Collections Curator at UNSW Canberra discusses the findings of her research into crowdsourcing based curation. Using the digitised historic Australian Newspapers as an example, she looks at how the functionality and interface was developed in close relationship with the users, and how this led on to text correction of newspaper articles. It is nearly ten years since this pioneering project began and the motivations and achievements of the 50,000 volunteers are examined over this time. She questions how successfully the goal of improving text quality and therefore search has been achieved. She proposes that if a similar project was begun now then artificial intelligence software would be used such as OverProof post OCR correction tool to improve the quality of the text. OverProof has been trained on the manual corrections of the Australian newspaper corpus and trials demonstrate it is able to dramatically improve the quality of the corpus. Volunteer text correction could still continue afterwards for difficult text but the software would do the main donkey work, allowing users to have a better quality search.

Item type: Presentation
Keywords: Trove, crowdsourcing, overproof, artificial intelligence, newspapers, libraries, digital libraries, transcription, australian newspapers, digitised newspapers
Subjects: C. Users, literacy and reading. > CB. User studies.
L. Information technology and library technology > LC. Internet, including WWW.
L. Information technology and library technology > LJ. Software.
L. Information technology and library technology > LK. Software methodologies and engineering.
L. Information technology and library technology > LS. Search engines.
Depositing user: Rose Holley
Date deposited: 03 Sep 2018 06:31
Last modified: 03 Sep 2018 06:31


Downloads per month over past year

Actions (login required)

View Item View Item