Archive:Phoenix Transcription
Archives > Archive:Extracts > Archive:The Whitney Family of Connecticut > Phoenix Transcription
The Phoenix Project Transcription Process
The process of transcribing a page of Phoenix's book is rather complicated. The way I do it is described below. I use as an example the transcription of page 2470.
Step 1: Scan the page to be done. Resulting image.
Step 2: Run optical character recognition (OCR) software, Textbridge Plus, version 1.0C. Resulting file. NOTE: I no longer have a working version of this software.--RLW
Step 3: Train the OCR software by making corrections of dubious words.
| Line | Original | Correction | Comments |
|---|---|---|---|
| 1 | 2470 | 2470 | OK |
| 1 | Generahon. | Generation. | typical error |
| 2 | (NICk•ion) | (Nickerson) | trouble with bold
face |
| 2 | wiaro. | Wiard. | upper vs. lower case |
| 2 | '9,45 | 19145 | numerical problem |
| 3 | 20311 | 20311 | OK |
| 3 | Je~I. | Jessie | trouble with bold face |
| 3 | N.Y., | N. Y., | spacing problem |
| 3 | r8yo. | 1870. | numerical problem |
| 4 | Rusmil | Russell | trouble with bold face |
| 4 | (cilbert) | (Gilbert) | trouble with bold face |
| 4 | 19163 | 19163 | OK |
| 5 | Olin, | Olin, | OK |
| 5 | Clinton, | Clinton, | OK |
| 5 | N.Y., | N. Y., | spacing problem |
| 5 | Oct. | Oct. | OK |
| 5 | Co., | Co., | OK |
| 5 | £8~. | 1869 | numerical problem |
| 6 | Clinton, | Clinton, | OK |
| 6 | N.Y., | N. Y., | spacing problem |
| 6 | r8yi. | 1871. | numerical problem |
| 7 | N. | N. | OK |
| 7 | £4 | 14 | numerical problem |
| 7 | £872. | 1872. | numerical problem |
| 8 | 20315 | 20315 | OK |
| 8 | il-leph | Joseph | trouble with bold face |
| 8 | N.Y., | N. Y., | spacing problem |
| 9 | Child | Child | OK |
| 9 | Annl~ | Annie | trouble with bold face |
| 9 | Iaui~ | Louise | trouble with bold face |
| 9 | oilisrt | Gilbert | trouble with bold face |
| 9 | iq'66 | 19166 | numerical problem |
| 10 | 20316 | 20316 | OK |
| 10 | '£4 | 114 | numerical problem |
| 10 | N.Y., | N. Y., | spacing problem |
| 11 | '874. | 1874. | numerical problem |
Step 4: Rearrange blocks of text and remove extraneous space. Resulting file.
Step 5: Run spell checker and correct errors.
| Line | Original | Correction | Comment |
|---|---|---|---|
| 2 | Nickerson | -- | proper name |
| 3 | Wiard | -- | proper name |
| 3 | Patterson | -- | proper name |
| 4 | Chil | -- | abbreviation |
| 4 | tucy | Lucy | |
| 4 | Olin | -- | proper name |
| 5 | CIib.rt | Gilbert | bold face problem |
| 9 | H.rman | Herman | bold face problem |
| 9 | Olibert | Gilbert | bold face problem |
| 10 | Cilbert | Gilbert | bold face problem |
| 10 | Balehen | Balchen |
Step 6: Proofreading.
| Line | Original | Correction | Comment |
|---|---|---|---|
| 2 | Jan. | Jane | bold face problem |
| 4 | or | of | |
| 7 | In. | III. | |
Step 7: Insert text blocks and page links into HTML template. Resulting file.
Step 8: Insert HTML header and footer templates. Resulting file.
Step 9: Copy the transcribed page and paste it in the edit window for [[Archive:The Whitney Family of Connecticut, page 2470]], preview it, and save it. Resulting page.
Step 10: Edit the Phoenix Project index page to reflect the additional pages added.
Copyright © 2005, 2006, Robert L. Ward and the Whitney Research Group