Join us at Dr. Abdul Latiff Library at the Universiti Kebangsaan Malaysia (National University of Malaysia) on August 23, 2018 for a day-long text mining workshop for librarians hosted by the IFLA Social Science Libraries Satellite Meeting. 

This workshop will be led by instructors from the IMLS-funded project “Digging Deeper, Reaching Further: Libraries Empowering Users to Mine the HathiTrust Digital Library Resources” (DDRF). DDRF provides a workshop curriculum that teaches library and information professionals to concepts and methods in text mining and related digital scholarship methods, with a focus on the tools and data from the HathiTrust Research Center. The aims of the DDRF workshops are to empower librarians to become more conversant in digital scholarship and engage with digital projects at their institutions. 

Here are some of the exciting things you can expect to learn and become familiar with during this session:

  • Building a corpus of texts in a HTRC Workset, and using it to conduct text analysis on your collection of works;
  • Gathering data through web scraping;
  • Cleaning data, dirty OCR, and clean OCR;
  • Using Python for text mining;
  • Topic modeling and other approaches for text analysis.

This workshop will run from 9:00 a.m. to 4:00 p.m. with a break for lunch. All are encouraged to attend, and no experience is necessary!

Seating will be limited, so please register at:  

Updates and room directions will be sent to registrants just prior to the event date, and you can visit for information about the workshops. 

Please contact with any questions. 

This workshop event is funded in part by IMLS award #RE-00-15-0112-15.

Best wishes on behalf of the Social Science Library Section!

Lynne M. Rudasill Professor

University Library Global Studies Librarian University of Illinois at Urbana-Champaign 306 International Studies Building – MC 402 Champaign, Illinois 61820 217.265.6879 – Voice 217.333.6270 – FAX