Follow us on Facebook to receive important updates Follow us on Twitter to receive important updates Follow us on sina.com's microblogging site to receive important updates Follow us on Douban to receive important updates
Chinese Text Project
Discussion -> Latest updates -> OCR integration and library linking: over ten million pages of pre-modern Chinese texts now searchable online

2015-03-24 12:16:27OCR integration and library linking: over ten million pages of pre-modern Chinese texts now searchable online
Posted by: admin (CTP Admin)A major update to the site has been made by applying OCR to over ten million pages of transmitted texts stored in the Library, linking scanned texts where possible to digital editions that follow them. Over 3000 existing texts have been successfully linked, allowing side-by-side display and textual searching of scanned texts.

Additionally, around ten thousand new texts and editions have also been transcribed for the first time using OCR. While these transcriptions inevitably contain many errors, they make it possible for the first time to search the scanned texts and immediately locate information within them. All newly transcribed texts have been added to the Wiki - please help by correcting errors when using these resources.

For further details, please see the OCR instructions.



To participate in the discussion, please log in to your CTP account using the form below. If you don't yet have an account, click here to set one up.

Log in
Username:
Password:
Keep me logged in
Forgotten password

Enjoy this site? Please help.Site design and content copyright 2006-2024. When quoting or citing information from this site, please link to the corresponding page or to https://ctext.org. Please note that the use of automatic download software on this site is strictly prohibited, and that users of such software are automatically banned without warning to save bandwidth. 沪ICP备09015720号-3Comments? Suggestions? Please raise them here.