Pages tagged ocr:

John Resig - OCR and Neural Nets in JavaScript
http://ejohn.org/blog/ocr-and-neural-nets-in-javascript/

Canvas element used to do basic image-processing on an image. Can a JS port of NumPy be far behind? What about the effects on expectations for javascript from users and engine writers? *Mind buzzes*
Breaking Captchas with a GreaseMonkey script
Convert PDF to Word (DOC) Online — 100% Free!
http://www.pdftoword.com/
converitr de pdf a word
Using our PDF-to-Word conversion technology, you can quickly and easily create editable DOC/RTF files, making it a cinch to re-use PDF content in applications like Microsoft Word, Excel, OpenOffice, and WordPerfect. Best of all, it's entirely free!
OCR Terminal: Free Online OCR - Convert pdf to word, jpeg to word, scanned images to editable text
http://www.ocrterminal.com/
OCR Terminal is an free online Optical Character Recognition service that allows you to convert scanned images and pdf's into editable and text searchable documents. It accurately preserves formatting and layout of documents.
Allows you to verify an e-mail verification token, finish the account creation procedure.
Free online OCR
http://www.free-ocr.com/
I have used it and it works. Limit to ten pages per hour, each 2 MB or less.
Prizmo 1.0 | Creaceed
http://www.creaceed.com/prizmo/
Lifehacker - Free OCR Converts Your Scanned Documents to Text - Scanner
http://lifehacker.com/5308284/free-ocr-converts-your-scanned-documents-to-text
Google Docs OCR
http://googlesystem.blogspot.com/2009/09/google-docs-ocr.html
Ver
A free OCR service from google, see also http://weocr.ocrgrid.org/
Google Docs API tests a new feature that lets you perform OCR (optical character recognition) on an image.
Perform OCR with Google Docs - Turn Scanned Images Into Editable Documents
http://www.labnol.org/internet/perform-ocr-with-google-docs/10059/
ing example, Google Docs successfully extracted all the text from a scanned book page
Turn Scanned Images Into Editable Documents
Uses the Open Source Tesseract OCR engine ( http://code.google.com/p/tesseract-ocr/ )
Google Docs can now perform OCR on digital images. You can upload an image containing typewritten or printed text (like a fax document or a scanned newspaper clipping) to your Google Docs account and it will turn that image into editable text.
Official Google Blog: Teaching computers to read: Google acquires reCAPTCHA
http://googleblog.blogspot.com/2009/09/teaching-computers-to-read-google.html
Teaching computers to read: Google acquires reCAPTCHA - http://bit.ly/mNdrd [from http://twitter.com/hadhad/statuses/4038838588]
Found this: Teaching computers to read: Google acquires reCAPTCHA: Shared by cec Wholly geek batman ... http://bit.ly/dTqYG [from http://twitter.com/kekil/statuses/4034854001]
"In this way, reCAPTCHA’s unique technology improves the process that converts scanned images into plain text, known as Optical Character Recognition (OCR). This technology also powers large scale text scanning projects like Google Books and Google News Archive Search. Having the text version of documents is important because plain text can be searched, easily rendered on mobile devices and displayed to visually impaired users. So we'll be applying the technology within Google not only to increase fraud and spam protection for Google products but also to improve our books and newspaper scanning process. That's why we're excited to welcome the reCAPTCHA team to Google, and we're committed to delivering the same high level of performance that websites using reCAPTCHA have come to expect. Improving the availability and accessibility of all the information on the Internet is really important to us, so we're looking forward to advancing this technology with the reCAPTCHA team."
I know I'm late to the game commenting on this one, but damn this kind of thing pisses me off. Can't we have just one thing that is cool on the internet without it getting acquired by Google or Yahoo? I'm not as anti-google as most, but all of a sudden reCAPTCHA feels exploitative. Brewster Kahle, where is the alternative for archive.org?
Reading: Teaching computers to read: Google acquires reCAPTCHA http://bit.ly/141R6p [from http://twitter.com/sandroalberti/statuses/4057584396]
Google acquire reCAPTCHA - teaching computers to read - http://bit.ly/IT1DZ [from http://twitter.com/nick_b/statuses/4050051801]
Google has acquired reCAPTCHA, a company that provides CAPTCHAs to help protect more than 100,000 websites from spam and fraud.
DocList API OCR Demo
http://googlecodesamples.com/docs/php/ocr.php
currently at demo stage and linked to google docs
Google Docs can now perform OCR on digital images. You can upload an image containing typewritten or printed text (like a fax document or a scanned newspaper clipping) to your Google Docs account and it will turn that image into editable text.
para probar
Free Online OCR service - convert PDF documents to Word, JPG to Word
http://www.onlineocr.net/
Nice! This is pretty cool.
OnlineOCR Converts Your Scanned Documents to Editable Text - Conversion - Lifehacker
http://lifehacker.com/5380470/onlineocr-converts-your-scanned-documents-to-editable-text
Optical character recognition "Whether it's a page of printed notes from an instructor, an old proposal you want to edit, or a letter your boss wants turned into a template, OnlineOCR can help take an image of text and turn it into an editable copy."
Top 5 Free OCR Software Tools To Convert Images Into Text
http://www.makeuseof.com/tag/top-5-free-ocr-software-tools-to-convert-your-images-into-text-nb/
Free OCR
Free Online OCR - Convert JPEG, PNG, GIF, BMP, TIFF and PDF to Text
http://www.newocr.com/
NewOCR is a free online OCR service that allows to convert scanned images and multi-page PDF documents to text, can process 29 languages and supports layout analysis
OCR online
About | Scan Tailor
http://scantailor.sourceforge.net/?q=en
ges read
ch as page splitting, deskewing, adding/removing borders, and others. You give it raw scans, and you get pages re
Scan Tailor is an interactive post-processing tool for scanned pages. It performs operations such as page splitting, deskewing, adding/removing borders, and others. You give it raw scans, and you get pages ready to be printed or assembled into a PDF or DJVU file. Scanning, optical character recognition, and assembling multi-page documents are out of scope of this project. [09wk47]
Windows/Linux: Tired of fiddling with scanned pages to remove borders, correct alignment, and otherwise prettify them before storing or emailing them? Then Scan Tailor could be right up your alley. This free app splits two-page scans into single documents, converts text to black and white without disturbing images, and cleans stray specks off pages. Scan Tailor even gets rid of that pesky shadow down the center of a two-page scan that occurs when you lay an open book face down on a scanner.
Post-processing of scans
start [Bkrpr Wiki]
http://bkrpr.org/doku.php
Easy DIY book scanner.
An open source book scanner project.
"BookLiberator is a set of free software and hardware to digitize books: it lets you photograph all the pages in a book without harming the book. The resulting images can be processed with free, open source software to make user-friendly files in a variety of formats." Imagine a plexilglass box, two cameras, and a stand.
BookLiberator is a set of free software and hardware that helps you digitize books.
This is the home page and the documentation wiki of BookLiberator. BookLiberator is a set of free software and hardware that helps you digitize books. Some of our hardware designs make it easy to photograph all the pages in a book without harming the book. Other designs allow you to remove the binding so the pages can be dropped into a sheet-fed scanner. Whichever method you use, the resulting images can be processed with our software to make user-friendly files in a variety of formats.
BookLiberator is a set of free software and hardware that helps you digitize books
Free your text, DIY-style!
book liberator -- diy book scanning device using two digital cameras
far simpler book scan design.
Megaupload auto-fill captcha for Greasemonkey
http://userscripts.org/scripts/show/38736
Plugin para o Firefox que reconhece captchas, com links para código fonte.
OCR in javascript
Auto-fills the megaupload captchas and optionally auto-starts download
neural-net OCR in javascript
WatchOCR
http://watchocr.com/
. Based on Knoppix, WatchOCR uses
WatchOCR is an open source OCR server that creates searchable pdfs from images in a watched folder.
Free OCR server for PDFs