光学文字認識とは何？わかりやすく解説 Weblio辞書

IT用語辞典バイナリ

索引トップ用語の索引ランキング画像一覧カテゴリー

OCR

フルスペル：Optical Character Recognition
読み方：オーシーアール
別名：光学文字認識

OCRとは、スキャナなどで入力された画像情報の中から、文字の形状に基づいて文字を識別し、コンピュータ上で扱える文字データへと変換する仕組みのことである。

OCRの機能を備えた装置やソフトウェアも、同じくOCRと呼ばれる。この場合のOCRはOptical Character Readerの略とされる。

書籍や新聞などの印刷物をスキャナで読み取ると、一面の画像として入力される。OCRでは、主にパターン認識の技術を用いて、画像中から文字情報を認識している。一般的には、スキャナから入力された画像をPC 上で専用のソフトウェアを利用して解析する方法が取られている。また、携帯電話の中にもOCRの機能が搭載された機種がある。

OCRを用いることで、例えば、古い書籍の情報を電子データ化する場合などに、タイピングによって人手で入力するよりも効率的に作業を進めることができる。

OCRは、あらかじめ登録された文字のパターンを参照して近似の形を判定するため、複雑な字形の漢字や、創作的な手書き文字を完全に正しく読み取ることは難しい。補助処理として、周囲の罫線などから文字の位置とつながりを確認する処理を行っている場合も多い。

プリンタ・スキャナのほかの用語一覧

スキャナ：

光学解像度光学式マーク読み取り装置ニポウディスクスキャナー OCR シートフィードスキャナスキャナ 3Dスキャン

>>スキャナカテゴリの他の用語

ウィキペディア

索引トップ用語の索引ランキングカテゴリー

光学文字認識

出典: フリー百科事典『ウィキペディア（Wikipedia）』 (2023/04/02 09:57 UTC 版)

光学文字認識（こうがくもじにんしき、英: Optical character recognition）は、活字、手書きテキストの画像を文字コードの列に変換するソフトウェアである。画像はイメージスキャナーや写真で取り込まれた文書、風景写真（風景内の看板の文字など）、画像内の字幕（テレビ放送画像内など）が使われる^[1]。一般にOCRと略記される。

脚注

注釈

^ カーツワイルは書体を選ばないOCR技術の発明者とされることもあるが、1960年代末ごろから同様の技術を開発する企業がいくつか出現している。詳しくは Schantz, The History of OCR; Data processing magazine, Volume 12 (1970), p. 46 を参照

出典

^ OnDemand, HPE Haven. “OCR Document”. 2016年4月15日時点のオリジナルよりアーカイブ。2016年4月15日閲覧。
^ ^a ^b Herbert Schantz, The History of OCR. Manchester Center, VT: Recognition Technologies Users Association, 1982.
^ "Reading Machine Speaks Out Loud" , February 1949, Popular Science.
^ Washington Daily News, April 27, 1951; New York Times, December 26, 1953
^ “音声ソフトの ScanSoft、競合する Nuance を買収”. japan.internet.com. (2005年5月10日)
^ Qing-An Zeng (28 October 2015). Wireless Communications, Networking and Applications: Proceedings of WCNA 2014. Springer. ISBN 978-81-322-2580-5
^ “Using OCR and Entity Extraction for LinkedIn Company Lookup” (2014年7月22日). 2016年4月17日時点のオリジナルよりアーカイブ。2017年6月16日閲覧。
^ “How To Crack Captchas”. andrewt.net (2006年6月28日). 2013年6月16日閲覧。
^ “Breaking a Visual CAPTCHA”. Cs.sfu.ca (2002年12月10日). 2013年6月16日閲覧。
^ John Resig (2009年1月23日). “John Resig – OCR and Neural Nets in JavaScript”. Ejohn.org. 2013年6月16日閲覧。
^ Tappert, C. C.; Suen, C. Y.; Wakahara, T. (1990). “The state of the art in online handwriting recognition”. IEEE Transactions on Pattern Analysis and Machine Intelligence 12 (8): 787. doi:10.1109/34.57669.
^ ^a ^b “Optical Character Recognition (OCR) – How it works”. Nicomsoft.com. 2013年6月16日閲覧。
^ Sezgin, Mehmet; Sankur, Bulent (2004). “Survey over image thresholding techniques and quantitative performance evaluation”. Journal of Electronic Imaging 13 (1): 146. Bibcode: 2004JEI....13..146S. doi:10.1117/1.1631315. オリジナルのOctober 16, 2015時点におけるアーカイブ。 2015年5月2日閲覧。.
^ Gupta, Maya R.; Jacobson, Nathaniel P.; Garcia, Eric K. (2007). “OCR binarisation and image pre-processing for searching historical documents.”. Pattern Recognition 40 (2): 389. doi:10.1016/j.patcog.2006.04.043. オリジナルのOctober 16, 2015時点におけるアーカイブ。 2015年5月2日閲覧。.
^ Trier, Oeivind Due; Jain, Anil K. (1995). “Goal-directed evaluation of binarisation methods.”. IEEE Transactions on Pattern Analysis and Machine Intelligence 17 (12): 1191–1201. doi:10.1109/34.476511 2015年5月2日閲覧。.
^ Milyaev, Sergey; Barinova, Olga; Novikova, Tatiana; Kohli, Pushmeet; Lempitsky, Victor (2013). “Image binarisation for end-to-end text understanding in natural images.”. Document Analysis and Recognition (ICDAR) 2013 12th International Conference on: 128–132. doi:10.1109/ICDAR.2013.33. ISBN 978-0-7695-4999-6 2015年5月2日閲覧。.
^ Pati, P.B.; Ramakrishnan, A.G. (1987-05-29). “Word Level Multi-script Identification”. Pattern Recognition Letters 29 (9): 1218–1229. doi:10.1016/j.patrec.2008.01.027.
^ “Basic OCR in OpenCV | Damiles”. Blog.damiles.com (2008年11月20日). 2013年6月16日閲覧。
^ ^a ^b ^c Ray Smith (2007年). “An Overview of the Tesseract OCR Engine”. 2010年9月28日時点のオリジナルよりアーカイブ。2013年5月23日閲覧。
^ “OCR Introduction”. Dataid.com. 2013年6月16日閲覧。
^ “How OCR Software Works”. OCRWizard. 2009年8月16日時点のオリジナルよりアーカイブ。2013年6月16日閲覧。
^ “The basic pattern recognition and classification with openCV | Damiles”. Blog.damiles.com (2008年11月14日). 2013年6月16日閲覧。
^ http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&p=1&u=%2Fnetahtml%2FPTO%2Fsearch-bool.html&r=1&f=G&l=50&co1=AND&d=PTXT&s1=10,679,089&OS=10,679,089&RS=10,679,089
^ ^a ^b ^c “How does OCR document scanning work?”. Explain that Stuff (2012年1月30日). 2013年6月16日閲覧。
^ “How to optimize results from the OCR API when extracting text from an image? - Haven OnDemand Developer Community”. 2016年3月22日時点のオリジナルよりアーカイブ。2020年12月21日閲覧。
^ Fehr, Tiff, How We Sped Through 900 Pages of Cohen Documents in Under 10 Minutes, Times Insider, The New York Times, March 26, 2019
^ “Train Your Tesseract”. Train Your Tesseract (2018年9月20日). 2018年9月20日閲覧。
^ “What is the point of an online interactive OCR text editor? - Fenno-Ugrica” (2014年2月21日). 2020年12月21日閲覧。
^ Riedl, C.; Zanibbi, R.; Hearst, M. A.; Zhu, S.; Menietti, M.; Crusan, J.; Metelsky, I.; Lakhani, K. (20 February 2016). “Detecting Figures and Part Labels in Patents: Competition-Based Development of Image Processing Algorithms”. International Journal on Document Analysis and Recognition 19 (2): 155. arXiv:1410.6751. doi:10.1007/s10032-016-0260-8.
^ “The Fifth Annual Test of OCR Accuracy”. 2012年4月27日閲覧。
^ Holley, Rose (2009年4月). “How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs”. D-Lib Magazine. 2011年1月5日閲覧。
^ Suen, C.Y., et al (1987-05-29). Future Challenges in Handwriting and Computer Applications. 3rd International Symposium on Handwriting and Computer Applications, Montreal, May 29, 1987 2008年10月3日閲覧。.
^ Tappert, Charles C., et al (1990-08). The State of the Art in On-line Handwriting Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol 12 No 8, August 1990, pp 787-ff 2008年10月3日閲覧。.

[続きの解説]

「光学文字認識」の続きの解説一覧

ウィキペディア小見出し辞書

索引トップ用語の索引ランキング

光学文字認識 (OCR)

出典: フリー百科事典『ウィキペディア（Wikipedia）』 (2022/02/21 23:08 UTC 版)

「第一種過誤と第二種過誤」の記事における「光学文字認識 (OCR)」の解説

一般に検出アルゴリズムは偽陽性に陥り易い。光学文字認識(OCR)ソフトウェアは "a" のように見えるドットの集まりを "a" であると認識してしまう可能性がある。

※この「光学文字認識 (OCR)」の解説は、「第一種過誤と第二種過誤」の解説の一部です。
「光学文字認識 (OCR)」を含む「第一種過誤と第二種過誤」の記事については、「第一種過誤と第二種過誤」の概要を参照ください。

ウィキペディア小見出し辞書の「光学文字認識」の項目はプログラムで機械的に意味や本文を生成しているため、不適切な項目が含まれていることもあります。ご了承くださいませ。お問い合わせ。

光学文字認識と同じ種類の言葉

>>同じ種類の言葉 >>分析に関連する言葉

>> 「光学文字認識」を含む用語の索引
光学文字認識のページへのリンク

光学文字認識とは？わかりやすく解説

OCR

光学文字認識

注釈

出典

光学文字認識 (OCR)

「光学文字認識」の関連用語


	Copyright © 2005-2024 Weblio 辞書 IT用語辞典バイナリさくいん。この記事は、IT用語辞典バイナリのOCRの記事を利用しております。
	All text is available under the terms of the GNU Free Documentation License. この記事は、ウィキペディアの光学文字認識 (改訂履歴)の記事を複製、再配布したものにあたり、GNU Free Documentation Licenseというライセンスの下で提供されています。 Weblio辞書に掲載されているウィキペディアの記事も、全てGNU Free Documentation Licenseの元に提供されております。
	Text is available under GNU Free Documentation License (GFDL). Weblio辞書に掲載されている「ウィキペディア小見出し辞書」の記事は、Wikipediaの第一種過誤と第二種過誤 (改訂履歴)の記事を複製、再配布したものにあたり、GNU Free Documentation Licenseというライセンスの下で提供されています。

光学文字認識とは？ わかりやすく解説

OCR

光学文字認識

注釈

出典

光学文字認識 (OCR)

「光学文字認識」の関連用語

光学文字認識とは？わかりやすく解説