Request for Question Clarification by
pafalafa-ga
on
29 Jul 2004 19:05 PDT
Hello clay_shirky-ga,
I've been mulling over this question for a while because it's an
interesting challenge. So far, though, I can't really get my hands
around a way to approach it.
In theory, one of the businesses like a phone company or large mailing
house that has access to super-large databases of names in the US
(some of them have over 200 million records!) could run a frequency
distribution and produce an answer. I doubt any of them have the
incentive to do so, however.
But...your friendly datacrunchers at the Census Bureau once created
exactly this sort of frequency distribution for last names. This
list, which is here:
http://www.census.gov/genealogy/names/dist.all.last
at least sheds a little light on the situation.
The first name on this list -- no surprise -- is Smith, the most
common surname in America, shared by 1.006% of all Americans (at
least, at the time of the Census on which this was based). Johnson,
Williams, Jones, Brown, Davis Miller, Wilson are next in line. The 8
surnames cumulatively account for 5% of all Americans.
If you think about, there are going to be very, very few unique names
included at the top of the surname list. Sure, there's probably a
Cougat Johnson out there, or a ESPN Smith, and some other unique
names, but these are the rare exceptions.
In fact, I'd say for the top 50% of names -- where you're still
dealing with pretty familiar names, like Siegel, Clinton, Kraft,
Kauffman, etc -- there's still likely to be only a tiny percentage of
unique names here.
Way down at the bottom of the list -- where you get the Aalund's and
Aalderlink's -- the reverse is likely true. I would guess that a very
high proportion of these names are unique names. There might be five
or ten John Aalund's floating around, but I don't think you'll find
too many Clay Aalund's.
By the way, Shirkey (with an "e") is number 24,219 on the list.
The 88,799 names on the Census list account for about 90% of the
population. That means the remaining 10% have surnames even rarer
than Aalund and Aalderlink!
I would guess, based on this, that maybe 2-5% of Americans have a
totally unique name...a higher number than I would have thought.
I'd be interested to hear what you or my fellow researchers think
about this guesstimate.
Cheers.
pafalafa-ga