|
|
Eastman's Online Genealogy Newsletter A Weekly Summary of Events and Topics of Interest to Online GenealogistsVol. 7 No. 28 – July 15, 2002 This newsletter was sponsored by Ancestry.com, To learn about Ancestry.com’s Past issues of this Newsletter are available at: This weekly newsletter is available in both HTML and ASCII text editions. Details are available at: http://www.RootsForum.com/newsletter Please feel free to forward copies of this newsletter to other genealogists. Copyright© 2002 by Richard W. Eastman. All rights reserved. If you contact any of the companies or societies mentioned in this newsletter, please tell them that you read about their services in this newsletter. - Soundex Explained Many genealogy records are indexed by a high-tech algorithm called the Soundex Code. Well, it was "high tech" in 1918 when it was invented by Robert Russell. In a nutshell, Soundex Codes provide a means of identifying words – especially names -- by the way they sound. They were used extensively by the WPA crews working in the 1930s to organize Federal Census data from 1880 to 1920. Soundex has also been used for many state and local census records and is very popular in genealogy software and databases. Motor vehicle bureaus in the District of Columbia, Maryland, Michigan, Minnesota, and Missouri employ Soundex for generating the initial characters of the identification numbers on driver's licenses. The Canadian Centre for Justice Statistics uses Soundex to encode names in its crime surveys and maintain the anonymity of individuals about whom data is collected. In the days when nearly all of the data for the Census of Population was collected by actual enumerators and individuals who walked from door to door, it was discovered that many of these people spelled surnames phonetically. Thus, one might spell Smith as "Smith" while another might spell it as "Smyth" and still another "Smythe." The census records were to be indexed by the sound of each name rather than by its spelling, and Soundex was the code system used to organize this index. If you search many records of interest to genealogists, sooner or later you will need to use Soundex Codes. Why? Well, you can often find a person’s entry by his or her Soundex Code even when the names have been misspelled. This becomes important when you realize that many census takers did not speak the language of the people being enumerated. In fact, in the first 150 years of U.S. census records, the majority of Americans were illiterate and did not know how to write their own last names. The spelling of many family names also has changed over the years, but often the Soundex Code remains the same. Spelling of names varies widely in early records, especially when language difficulties have intervened. For instance, I could not find my French-speaking great-grandparents listed in the U.S. Census. I searched and searched, but never found any entries for Joseph and Sophie Theriault. I then decided to do a Soundex search. The Soundex Code for Theriault is T643. When searching for Soundex Codes, I found several entries for T643 in Ashland, Maine, including one for the family of Joseph and Sophia Tahrihult -- improperly spelled, but with the same Soundex Code. The census taker had a Scottish name, and he was listed on another census page in the same town as a being born in Scotland. I am guessing that he did not speak French. I bet he had some difficulty when speaking with my great-grandparents, neither of whom spoke English and neither of whom could read or write. No wonder Theriault became Tahrihult! The Soundex Code is not difficult to learn although I still use a small reference card when I go to the archives to look at records. Every Soundex Code consists of a letter and three numbers, such as W-252. The letter is always the first letter of the surname, and the hyphen is optional. The numbers are assigned to the remaining letters of the surname according to the Soundex guide shown below. If necessary, zeroes are added at the end to produce a four-character code. Additional letters are disregarded. Here is the Soundex Coding Guide:
Here are some of the simpler examples:
Now let’s move on to some of the more complex rules:
While the rules sound a bit complex, they do become easier with a bit of practice. For those of us who are too lazy to go through the coding exercise, the computer age has brought many new tools. Most modern genealogy programs will tell you the Soundex Code of any name that you enter. In addition, a number of online Soundex Machines are available, including those at: http://resources.rootsweb.com/cgi-bin/soundexconverter, http://www.searchforancestors.com/soundex.html, http://www.geocities.com/Heartland/Hills/3916/soundex.html, http://www.pa-roots.com/soundex.html and http://www.genealogy.org/soundex.shtml. On any of these sites, you type in a last name, and then the site will display the correct Soundex Code. Yet Another Soundex Converter (YASC) at http://www.bradandkathy.com/genealogy/yasc.html will even convert a long list of names to their Soundex equivalents; you do not have to enter them one at a time. The National Archives and Records Administration publishes a free brochure, entitled Using the Census Soundex. To obtain a copy, send an e-mail to inquire@nara.gov and ask for General Information Leaflet 55, usually referred to as GIL 55, Make sure that you include your name, postal address, and "GIL 55 please". Anyone hosting genealogy pages on a UNIX or Linux Web server might want to know about a bash script called soundex.ss that is available at http://www.unixreview.com/documents/s=7458/uni1026336632258/0207e.htm. If you are familiar with bash, you can add a Soundex machine to your Web site. A similar program, written in C, is available at http://www.unixreview.com/documents/s=7458/uni1026336632258/0207e_C.htm. My thanks to Mark D. Aubrey for telling me about the soundex.ss bash script. While Soundex is a great tool and in widespread use, it certainly is not perfect. For example, it fails when the first letters are different. For instance, Knowles is coded as K542 while both Noles and Nolles are N420. Likewise, Cantor is C536 while the similar sound of Kantor is K536. Soundex also has a number of shortcomings when dealing with Eastern European Jewish names. Two Jewish genealogists, Randy Daitch and Gary Mokotoff, developed a more sophisticated system, more suitable for Jewish genealogy. The Daitch-Mokotoff Soundex is becoming the de facto standard for on-line lookups on Jewish-related web sites. You can read more about the The Daitch-Mokotoff Soundex in an article written by Gary Mokotoff at http://www.avotaynu.com/soundex.html. Numerous other improved Soundex methods have been developed in recent years and are in widespread use on numerous computer databases. However, these newer "improved Soundex" methods have never seen much use in genealogy databases. Now, have fun with census records! - World War I Draft Registration Cards Online The following is an announcement from MyFamily.com, the publishers of this newsletter:
- FGS Conference in California The U.S. Federation of Genealogical Societies will hold their annual conference in less than a month: August 7 through 10. This year’s event is hosted by the California State Genealogical Alliance and will be held at the new Ontario (California) Convention Center, which is directly adjacent to the Ontario International Airport, about 55 miles east of Los Angeles. Most U.S. airlines fly to that airport. The annual FGS conference usually is a good one to attend, and I suspect this year’s edition will be no exception. This year’s conference will feature more than 150 lectures presented by about 60 genealogy experts. The presentations range from topics for genealogy novices through some rather advanced research techniques. As you might imagine, for a conference sponsored by a consortium of genealogy societies, there are quite a few session topics dedicated to improving your local society’s operation and appeal. You can find a listing of all the planned presentations at: http://www.fgs.org/2002conf/conf-order.asp. The Federation of Genealogical Societies has extended the Early Registration Discount until 26 July 2002. You can register before that date, either for $86 for each day you plan to be there or for $141 for all four days. Prices will jump significantly after July 26. The conference has two conference hotels this year. The DoubleTree Hotel Ontario will serve as the hotel headquarters for the conference. You can view the accommodations there and make reservations at: http://www.doubletree.com/en/dt/hotels/index.jhtml?ctyhocn=ONTO-DT&bhref=http%3A%2F%2Fwww.google.com%2Fsearch%3Fhl%3Den%26ie%3DUTF-8%26oe%3DUTF-8%26q%3DDoubleTree%2BHotel%2BOntario%2B%26btnG%3DGoogle%2BSearch&bhjs=1&bhsw=1280&bhsh=1024&bhswi=981&bhshi=808&bhflver=5&bhdir=1&bhje=1&bhcold=24&bhrl=3&bhqt=1&bhmp=1&bhab=-1&bhmpex=6,4,9,1119&bhflex=6,0,29,0&bhdirex=&bhcont=lan. The Ontario Marriott is also being used as a conference hotel and is available at: http://www.marriotthotels.com/dpp/PropertyPage.asp?MarshaCode=ONTCA&checked=True&Marriott_BD={C63236F9-B176-4C70-9720-58E2E8F51079}. Both hotels are close to the Ontario Convention Center and to local restaurants. I am told that you won’t need a car if you stay at either of these two hotels. For more information regarding the conference, program and registration, go to: http://www.fgs.org/2002conf/FGS-2002.htm. I can offer one hint: while many airlines serve Ontario and nearby Los Angeles airports, Southwest Airlines is having an airfare sale right now. I made reservations this week from the East Coast to Los Angeles on Southwest Airlines for $99.00 each way. That’s not a bad price for a 3,000-mile flight! You must purchase tickets 21 days in advance, however. Looking at a calendar shows that you will need to make your reservations before July 17 in order to qualify for these low fares. You can check prices and make reservations at: http://www.southwest.com. - GENTECH and National Genealogical Society Merger Completed It is now official: GENTECH is a part of the U.S. National Genealogical Society. The GENTECH board and members approved the action on July 8, 2002. For all practical purposes, this concludes the merger process. All that remains is the technical procedure of filing the merger documents with the secretaries of state in the home states of GENTECH and NGS, which attorneys will do immediately. The merger agreement calls for NGS staff to take on the duties previously performed by GENTECH officers, a process that is already happening. I first reported on this planned merger last January (see http://www.ancestry.com/library/view/columns/eastman/5185.asp). The GENTECH Web site has more details, and the National Genealogical Society’s Web site also has a small mention of the merger at: http://www.ngsgenealogy.org/gentech.htm. (The NGS site simply says to check the GENTECH site for details.) It is sad to see GENTECH lose its unique status and perhaps its identity. However, the organization completed its primary goals. About ten years ago GENTECH was formed by a small group of visionaries in the Dallas, Texas area. The organization was chartered to help bridge genealogy and technology. Ten years later, that mission has been accomplished. Partly due to the efforts of GENTECH and also because of the pervasiveness of modern technology into almost every corner of our everyday lives, genealogists now embrace technology in a manner barely dreamed of ten short years ago. I well remember attending the first GENTECH conference in 1992. This was before many people had heard of the World Wide Web. Several speakers, myself included, predicted that someday there would be online genealogy databases that we all could access while seated at home. Some skeptical attendees remarked, "Not in my lifetime!" Ten years ago the term "computer genealogist" was seen as an elite status by some and as a derogatory term by others. In the year 2002, the term has almost disappeared simply because it no longer signifies anything unique. Almost all of today’s genealogists are "computer genealogists." The founders of GENTECH and the many others who helped over the years are to be commended for a job that was very well done. GENTECH did bridge genealogy and technology. They also put on some dynamite conferences. The GENTECH Genealogical Data Model and the Lexicon Working Group received less publicity but, in fact, created standards that will benefit genealogists for many years to come. For the foreseeable future, the GENTECH web site will continue to be a source of information related to the merger transition, the GENTECH conferences, the scholarship program, and related news. Keep an eye on http://www.gentech.org. The GENTECH2003 conference is scheduled for January 17 and 18 and promises to be a bigger and better affair than any of the previous GENTECH conferences. About 1,100 people attended the last GENTECH conference, and there is an excellent chance that the 2003 edition in Phoenix will be even larger. I know that I wouldn’t miss it. The GENTECH2003 conference is co-sponsored by the Arizona Genealogical Computer Interest Group (AGCIG). Keep an eye on the AGCIG Web site at http://www.agcig.org/gt03/index.htm for details. If you have any questions about the impact of the NGS/GENTECH merger, send email to info@gentech.org. - Update on International Roots Conference The International Roots Conference in Dearborn, Michigan was scheduled to start at about the same time this edition of the newsletter is being distributed. You probably have heard by now that the conference organizers canceled the event more than three weeks ago and have refused to refund money to those who pre-registered. If you are one of the many people who sent money to the International Roots Conference for pre-registration, you should be reading the messages at the IRC-Victims mailing list at: http://groups.yahoo.com/group/irc-victims/join. Despite claims by the conference organizers that no refunds would be made, several people on that mailing list report they have successfully obtained refunds from their credit card companies. Apparently some credit card companies are paying the refunds to their customers without question because of the fraudulent charges involved. The assumption is that the credit card companies will seek reimbursement from the conference organizers. There is no way of knowing if the credit card companies will ever get their money from the organizers or not, but that is no longer a concern for the customers who receive permanent refunds. In other news, a number of genealogists have reported receiving conference advertising two weeks or more after the conference organizers canceled the conference. These newly delivered postcards invite the recipients to register in advance and to sign up for the (extra-cost) luncheons and workshops. Not only were these mailed some time after the cancellation announcement, but the amazing thing is that these post cards also invite the recipients to send checks or credit card information to the conference organizers! I wonder if some people did send money or credit card information at this late date. I suspect that not everyone has heard about the conference cancellation and the statement of "no refunds." I also wonder how many people will show up at the conference hotel on Sunday afternoon because they haven’t heard about the cancellation. - Update on 1901 U.K. Census Fiasco On January 1, 2002, the U.K. Public Record Office (PRO) placed the 1901 Census Records online amongst great hoopla and celebration. However, the system crashed again and again. In fact, it was operational only a few minutes in between crashes. After a few days, the PRO "pulled the plug" and announced that they were ordering more servers. The PRO’s announcement stated, "The site remains unable to meet continuing levels of demand. The PRO has agreed with QinetiQ's technical team that searching of the database and downloading of images will not be available for one week to allow enhancements to take place." QinetiQ (pronounced "kinetic") is the government owned company that did the technical work. For background information, read my articles at http://www.ancestry.com/library/view/columns/eastman/5123.asp. and at http://www.ancestry.com/library/view/columns/eastman/5619.asp. In the short time it was operational, genealogists found many problems with the online census records. Besides the issues of system availability, there are questions about the indexes, which reportedly are highly inaccurate. There are further questions about the sale of vouchers that remain worthless. The "one week to allow enhancements to take place" stretched into many weeks and even months with no new announcements, other than "we are working on it." British genealogists have demanded an explanation and have contacted several Members of Parliament. Accusations are flying. Now, six and a half months after the site’s original debut, the Public Record Office has made a brief announcement:
In other words, they are still "working on it." - National Archives Proposes Change in Availability of Records The following announcement is from the U.S. National Archives and Records Administration:
- Massachusetts Public Records Closing Alert Access to public records threatens to be curtailed in many places. Massachusetts is the latest state to consider closing access to records that are vital to genealogists. The following was written by Sharon Sergeant of the Massachusetts Genealogical Council: Festa of Melrose, Nangle of Lowell, Rivera of Springfield, Bradley of Hingham, Brown of Wrentham and Hill of Ipswich. http://www.state.ma.us/legis/comm/j17.htm The Massachusetts Genealogical Council welcomes all questions from interested researchers, as well as shared experiences from other jurisdictions dealing with ill-conceived, reactionary measures to identity fraud, privacy or records preservation and access issues. Please send your questions and suggestions to info@massgencouncil.org. If you are among the millions who claim early ancestors in Massachusetts - prior to the proposed 1910/1950 exclusions - and do not believe you will be impacted by such a law, please consult with your contemporary cousins and resident collateral lines. Your family will be affected! Thank you for your interest and support in this matter. Sharon Sergeant Program Director, - New Jersey Initiates Open Public Records Act Just at a time when various states are closing access to their records, New Jersey is moving in the opposite direction. New Jersey has wisely passed into law an act for the greater opening of public records in the state and plans to do more for current state records now considered closed to the public. The Government Records Council (GRC) has published a Public Guide to Open Public Records Act (OPRA) on their website at http://www.nj.gov/grc. The link for the Public Guide to OPRA is http://www.nj.gov/grc/publicflyer.pdf. The law went into effect on Monday, July 8 and is relatively progressive. Although vital records in the state do not fall under this act (and neither do other select records for privacy concerns such as juvenile records and criminal investigations), it certainly is a step in the right direction for the state's genealogists.The following FAQs (frequently-asked questions) are from the GRC web site:
My thanks to Chad Leinaweaver, Director for the Library at the New Jersey Historical Society, for telling me about the new Open Public Records Act. - FBI Hires LDS Church Data Chief In the March 27, 2002 edition of this newsletter, I wrote about the FBI efforts to improve their databases used to track criminals. The agency consulted with some of the LDS Church computer experts who manage the huge genealogy databases in Salt Lake City. You can read my article at: http://www.ancestry.com/library/view/columns/eastman/5471.asp. One thing that amuses me is that several people wrote to me after that article was published, claiming that the article was not true. They said that that the FBI was not talking with the LDS computer experts and that the original articles on the wire services were unfounded. Well, somebody at the FBI obviously talked to at least one senior official of the LDS Church’s computer staff. This week the FBI announced that Darwin A. John, formerly managing director of Information and Communications Systems at the LDS Church, has been appointed to become the FBI's Chief Information Officer. Since 1990, Darwin A. John set strategic direction for use of computer and communications technology worldwide for the Church of Jesus Christ of Latter-day Saints. He also helped lead creation of the church's very popular genealogy Family Search Web site at http://www.FamilySearch.org. Since appearing in 1999, that site has averaged 7 million to 8 million hits a day from people searching the 900 million names in the system. FBI Director Robert Mueller said John brings a "demonstrated capability to achieve broad-based results in an area critical to the FBI's success, particularly at a time when the bureau is modernizing its information technologies while reorganizing and re-engineering and undergoing unprecedented change in its investigative mission and priorities." Stephen Finnerty, president of the Society for Information Management, described John as "one of the most progressive leaders in the information systems industry," and said he expects him to make "valuable and outstanding contributions to the FBI, its mission, and the American people." You can read the full story at a number of news sites, including: http://deseretnews.com/dn/view/1,1249,405017185,00.html - Super Bowl Hero’s Family Tree Football fans – especially those in the northeastern U.S. – will long remember Adam Vinatieri as the field goal kicker who won the 2002 Super Bowl in overtime for the New England Patriots. Now an article in USA Today claims that Vinatieri is a distant cousin of daredevil Evel Knievel and also is the great-great-grandson of General George A. Custer’s bandleader. Luckily for Adam, the band didn’t accompany Custer to Little Big Horn. His great-great-grandfather survived and raised a family. It’s an interesting story, but no references to source documentation are offered. You can read about Adam Vinatieri at: http://www.usatoday.com/sports/ccovthu.htm - Introducing a New Section: New Books I am going to experiment with a new section in this newsletter. "New Books" will periodically announce new books of interest to genealogists. Each book mentioned in this new section will be one that is newly published or perhaps is a significant new update of a book published some years ago. Each listing will probably contain the book title, a very brief description of the book (typically one to three sentences), followed by the author’s name, the publisher’s name (optional) and then a Web address or e-mail address where you may obtain more information or even purchase the new book. I hope to list books that are self-published by individuals or smaller societies as well as books from the major genealogy publishers. The books do not have to be "pure genealogy" books; I will also list county histories, atlases and more. I will, however, limit this section to books published on paper. The new section is not for listing books on CD-ROM or online, as I have always written reviews of high-tech genealogy publications elsewhere in this newsletter. I hope to continue reviewing high-tech publications separately. Please let me know your opinion of the new section, and also please feel free to offer suggestions on how to improve it. Here are the first book announcements of this new section:
A note to authors and publishers: If you would like to have your new book(s) listed in future newsletters, send a brief descriptive note to newbooks@rootsforum.com. You do not need to send a copy of your book; an announcement will suffice. Please make sure that you include a Web address or an e-mail address where potential buyers can obtain more information. Are you interested in the articles in this newsletter? Would you like to learn more or ask questions or make comments about these articles? Join this newsletter’s online discussion group on CompuServe’s Genealogy Techniques Forum. The CompuServe forums are free and are available to anyone using Netscape, Internet Explorer or CompuServe’s own software Go to: http://www.rootsforum.com. If you would like to submit news, information or press releases for possible inclusion in future newsletters, send them to richard@eastman.net. The author does reserve the right to accept or reject any articles submitted. DISCLAIMER: This newsletter is being written and sent via e-mail at no charge. I expect to write one new issue on a more or less weekly basis. However, life sometimes interferes, and the need to earn a living may create an occasional delay. COPYRIGHTS: The contents of this newsletter are copyright by Richard W. Eastman with the following exception:
You are hereby granted rights, unless otherwise specified, to re-distribute articles from this newsletter to other parties provided:
Also, please include the following statement with any articles you re-distribute:
Thank you for your cooperation. About the author: Dick Eastman is a frequent presenter at major genealogy conferences. He has published articles in Genealogical Computing and Family Chronicle magazines and for a number of Web sites. He was an advisor to PBS' Ancestry series and appeared as a guest in one of the episodes. He serves on the Advisory Board of the New England Historic Genealogical Society and is a past Director of GENTECH and the New England Computer Genealogists. Dick is the author of YOUR ROOTS: Total Genealogy Planning On Your Computer published by Ziff-Davis Press. He also manages three Genealogy Forums on CompuServe. He can be reached at: richard@eastman.net. Due to the volume of e-mail received, he is unable to answer every e-mail message received. Subscription information: This newsletter is sent to you free of charge. Please feel free to copy this subscription information and pass it on to anyone else who you think might be interested in obtaining a free subscription. The method of subscribing and unsubscribing changed recently. The easiest method to subscribe or unsubscribe is via the Web. Go to: http://www.rootsforum.com | |||||||||||||||||||||