Articles by: sroth

Finding Similar Documents Without a Full Text Index

Finding Similar Documents Without a Full Text Index

Is there a way to quickly find similar documents in a Documentum repository? Yes, there is. One approach could be to use the Lucene MoreLikeThis() API. This API call to the Lucene Full Text search engine extracts what it believes to be the most salient words from a...

read more
How to Export Tabluar Data in Captiva 7

How to Export Tabluar Data in Captiva 7

Armedia has a customer using Captiva 7 to automatically capture tabular information from scanned documents. They wanted to export the tabular data to a CSV file to be analyzed in Excel. Capturing the tabular data in Captiva Desktop proved to be simple enough, the...

read more

Art and Computer Science

I picked up a book in Armedia's technical library by accident the other day, but have come to really appreciate the rewards of that serendipitous event. I first grabbed the book because of its author, Don Knuth, is a well-known innovator in the computer science world....

read more

A New Kind of Business Philosophy?

Over the Christmas holiday, a colleague gave me The Go-Giver, by Bob Burg and John David Mann. It’s a charming little parable with huge lessons. While reading the authors’ five laws for success (stratospheric success!), I thought to myself: these are principals I...

read more