What are TDM platforms?
TDM Studio Workbench
TDM Studio Visualization
Note: If you prefer to run analyses on your own computer using your own code, you can download up to 5000 documents at a time.
Nexis Data Lab (LexisNexis) allows you to do TDM research on materials licensed by LexisNexis that the Library subscribes to. Analyses can be conducted using either Python or R in Jupyter notebooks. Nexis Data Lab is offered by the Library.
HathiTrust Data Capsules are secure virtual environments for non-consumptive text analysis, where researchers can implement their own data analysis and visualization tools.
In other words, you log into a virtual machine where you will have access to OCRed texts from the HathiTrust Digital Library. You can run your own analyses on this data. You export your results, but not the corpus itself.
Anyone can use the data capsule and work with public domain materials. In addition, since UC Berkeley is a HathiTrust member, UC Berkeley researchers can include in their corpus material still in copyright.