The use of AI is being considered for data collection from images to complete data and reduce errors when collecting data in the field, specifically in unstructured or open text variables such as First Name, Last Name, Address, Date of Birth from an identification document, for example, from a driver's license.
First Name: IMA
Last Name: CARDHOLDER
Date of Birth: 08/31/1977
Address: 2570 24TH STREET ANYTOWN, CA 95818
Is the use of AI possible in CAPI Text Data Collection from Image?
-
vgonzalez
- Posts: 33
- Joined: March 12th, 2020, 4:19 pm
- Location: VENEZUELA
Is the use of AI possible in CAPI Text Data Collection from Image?
You do not have the required permissions to view the files attached to this post.
-
Gregory Martin
- Posts: 1946
- Joined: December 5th, 2011, 11:27 pm
- Location: Washington, DC
Re: Is the use of AI possible in CAPI Text Data Collection from Image?
We haven't looked into adding anything like this to CSPro, but it could be possible to add a wrapper around something like Tesseract so that text could be extracted from images:
https://github.com/tesseract-ocr/tesseract
What is your use case?
https://github.com/tesseract-ocr/tesseract
What is your use case?
-
htuser
- Posts: 686
- Joined: December 19th, 2011, 6:26 pm
- Location: Silver Spring Area, MD, USA
Re: Is the use of AI possible in CAPI Text Data Collection from Image?
@vgonzalez
Ah Ah... I worked on the same issues, nearly four(4) years ago. Please take a look at my post here: viewtopic.php?p=14388
You can also read: viewtopic.php?p=18675
More recently on the web from the JS ecosystem:
a) A recent JS Compatible use case can be read here: https://transloadit.com/devtips/integra ... seract-js/
b) Working example : https://github.com/simonw/tools/blob/main/ocr.html
c) Tesseract alternative: https://github.com/scribeocr/scribe.js/tree/master and comparisons: https://github.com/scribeocr/scribe.js/ ... sseract.md
However Worker Thread action invoker support by the CSPro Developer Team is very important for such operations in CSPro powered applications.
Josh is implementing a very innovative feature that will boost CSProusers productivity using similar and more advanced ML models: https://github.com/jhandley/survaize
With the complete JS-CSPro logic bidirectional API of the future CSPro 8.1, such things will be more portable to the CSPro ecosystem.
I'm waiting the alpha or beta version version for some testing.
Best
Ah Ah... I worked on the same issues, nearly four(4) years ago. Please take a look at my post here: viewtopic.php?p=14388
You can also read: viewtopic.php?p=18675
More recently on the web from the JS ecosystem:
a) A recent JS Compatible use case can be read here: https://transloadit.com/devtips/integra ... seract-js/
b) Working example : https://github.com/simonw/tools/blob/main/ocr.html
c) Tesseract alternative: https://github.com/scribeocr/scribe.js/tree/master and comparisons: https://github.com/scribeocr/scribe.js/ ... sseract.md
However Worker Thread action invoker support by the CSPro Developer Team is very important for such operations in CSPro powered applications.
Josh is implementing a very innovative feature that will boost CSProusers productivity using similar and more advanced ML models: https://github.com/jhandley/survaize
With the complete JS-CSPro logic bidirectional API of the future CSPro 8.1, such things will be more portable to the CSPro ecosystem.
I'm waiting the alpha or beta version version for some testing.
Best
G.VOLNY, a CSProuser from Haiti, since 2004
-
vgonzalez
- Posts: 33
- Joined: March 12th, 2020, 4:19 pm
- Location: VENEZUELA
Re: Is the use of AI possible in CAPI Text Data Collection from Image?
Thank you both very much, I'll take a look at the previous posts and see how I can start a project from zero to see how it goes.