Digitization of Modulus (student yearbook) Now Complete Through 1980

February 4, 2010

Bookmark and Share

Rose-Hulman Modulus Yearbooks

The digitization process of the Modulus, the student yearbook for Rose-Hulman Institute of Technology, is moving along at a swift pace due our diligent student worker.  Elizabeth is producing high quality scans at a very fast pace.

We have scanned and put online all yearbooks through 1980.  1982, 1985 and 1987 are already online.  1981 and 1983 are scanned and 1984 is on the works.  We hope to have the 1980s finished by the end of the month and the remaining 10 completed by the end of the school year.

We are able to conduct this process much faster than in the past because we are only doing one scan, a high resolution TIFF from which the OCR feature in CONTENTdm does a decent job importing the text.

http://www.rose-hulman.edu/Archives/modulus.html


Editing OCR Transcript Field in CONTENTdm

October 22, 2009

Bookmark and Share

Are you a user of CONTENTdm 5.x?  Have you fooled around with the OCR feature for TIFF images?  If you have not or have and found it frustrating, here are some tips.  First, it works best with text with basic, easy to read fonts; the larger the better.  Like most OCR software, the smaller the font the more likely there will be mistakes in the OCR text.  The same goes for fancy fonts.  We scanned a yearbook from 1901 that used this font that was similar to Old English and we had to make corrections on almost every line.  But even with good clear text, there are bound to be issues and sometimes images can be interpreted as text and so a string of strange characters will be entered into the transcript field.

Here is the BIG TIP!! If you are building a compound object of many pages such as a yearbook and using the OCR feature, edit the transcript fields for each page while still in the CONTENTdm Project Client BEFORE uploading the object and its files.  This method is much faster than editoing the transcript fields once it has been uploaded.  I have found this out the hard way.  I uploaded about 5 yearbooks and then had my students find each page in the web administrator module.  This is very inefficient as you have to first search for the page, then open and edit it, and then close it.  All this is a slow process for each page.  A much quicker way is to do it right in the CONTENTdm Project Client, after you have built the object, but before you upload it.  You can edit one page right after another much faster.


Follow

Get every new post delivered to your Inbox.