What are TDM platforms?
TDM Studio Workbench
TDM Studio Visualization
Note: If you prefer to run analyses on your own computer using your own code, you can download up to 5000 documents at a time.
The LexisNexis Web Services API (WSAPI) is a subscription service that enables researchers to download and build text corpuses from the Nexis Uni subscribed collection for further analysis. The WSAPI is provided by the UC Berkeley Library.
Tool limitations:
Searches will be scheduled to run over the weekend to minimize service disruption.
May not initiate more than 749 searches per hour nor retrieve more than 3000 documents at a time.
Skills required to use the API:
Please contact consultants at the D-Lab if you would like assistance with any of the below requirements.
Knowledge of R or Python, JSON, and XML
Familiarity with API calls
Please visit the LexisNexis Web Services API guide for more information.
HathiTrust Data Capsules are secure virtual environments for non-consumptive text analysis, where researchers can implement their own data analysis and visualization tools.
In other words, you log into a virtual machine where you will have access to OCRed texts from the HathiTrust Digital Library. You can run your own analyses on this data. You export your results, but not the corpus itself.
Anyone can use the data capsule and work with public domain materials. In addition, since UC Berkeley is a HathiTrust member, UC Berkeley researchers can include in their corpus material still in copyright.