Document Imaging Report published “Parascript Aims Classification at E-Governance” by DIR Editor and Publisher Ralph Gammon on June 26, 2015. A brief excerpt is provided here – Auto-classification is one of the hottest buzzwords in the document capture market. Every ISV these days has an auto-classification product that can be used to identify similar groups of documents. But, what is the killer app? We’ve seen some success in areas like identifying document types within large mortgage files, but it doesn’t seem like we’ve found that killer app yet. Parascript, the Boulder, CO-based recognition technology ISV, thinks it may be information governance.
“There is no question that auto-classification is an over-used term,” said Greg Council, VP of marketing and product management at Parascript. “The use cases we’ve seen have been basically confined to identifying document types within a pre-defined package of documents. Typically, users know what they are looking for.
“We started down the road of enhancing our document classification technology to address that type of need. We have a software partner in France working on a contract conformity application. They basically have to ensure that a package of documents is complete so it can be considered a valid contract. This involves solving three primary problems: establishing that the right documents are there, identifying document boundaries (first and last pages—basically, document separation), and running rules to look for specific data.
“I’ll admit that we were kind of weak with our auto-classification previously, but when we started addressing that application, we saw the opportunity to go beyond this type of use case. The broader market we think is information governance. Within that area, we fixated on two primary business problems.
“The first is the ability to control and manage documents within a records management (RM) system. Most current RM applications require that end users, people like subject matter experts or even file clerks and records managers, tag documents. However, in today’s IT environment there are so many storage options that users will often bypass their RM requirements. This creates a real problem as organizations don’t even understand what they have and therefore can’t control it. Basically, if it’s not tagged, it’s not recognized by an RM application.
“The second problem is more closely related to ECM, and that is findability. This is related to not having a good taxonomy around documents and ensuring that they are all defined the same way. Everybody has unstructured search engines in their ECM applications, but they still have trouble finding documents.
“Let’s take a credit union that we’ve been talking to. The loan officers and CSRs are having a real problem locating the documents they need to service customers during interactions. When this organization adopted its current document management system, all its documents were merged into it, but they are only classified by account numbers. So, if a customer wants information related a specific document, the CSR has to page through their entire file.
“They also have a warehouse of documents, and they don’t know what to keep as they transition to a new document management system. For example, they don’t know exactly which documents have value due to their being associated with existing accounts. They would also like to eliminate any duplicates.
“They are looking at employing six staff members to scan and visually look at each document to apply meta data. With auto-classification technology, if you set up proper rules, we think this should be able to be accomplished by a single person.” ….
For the full article, “Parascript Aims Classification at E-Governance” go to the Document Imaging Report website.