Text Creation PartnershipStandardized, accurate, and faithful XML/SGML-encoded electronic text editions of early printed books. We’ve transcribed and marked up text — through manual keying, rather than optical character recognition (OCR) — from millions of static page images in ProQuest’s Early English Books Online, Gale Cengage’s Eighteenth Century Collections Online, and Readex’s Evans Early American Imprints. Raw transcripts are available for bulk download as zipped files for those wishing to do text mining or similar projects.
https://textcreationpartnership.org/faq/#faq05standardized, accurate, and faithful XML/SGML-encoded electronic text editions of early printed books. We’ve transcribed and marked up text — through manual keying, rather than optical character recognition (OCR) — from millions of static page images in ProQuest’s Early English Books Online, Gale Cengage’s Eighteenth Century Collections Online, and Readex’s Evans Early American Imprints. Raw transcripts are available for bulk download as zipped files for those wishing to do text mining or similar projects.