RETURN TO CONTENTS

MOTHER TONGUE 1 - INTRODUCTION AND OVERVIEW AUGUST 2008

W.G. Davey: All Rights Reserved

wgdavey@zianet.com

INTRODUCTORY REVIEW, DATA COLLECTION AND ORDERING NAMES, AND ANALYSIS:

INTRODUCTORY REVIEW:

Our primary finding is that languages across the World that are remote geographically, racially, historically, technologically, and linguistically have similar names for parts of the body. Thus they must be derived from a single older language, the "Mother Tongue". Our second important finding is that the languages of the Americas are strongly linked with all the languages of the Old World. In this sense the Americas are not remote lands but one of the most important factors of world languages. These are the major conclusions from a study of the names for one hundred parts of the body or body secretions in a massive database of over 750 languages across the World. The names of bodyparts were chosen because of the simple fact that all peoples have the same bodies wherever and however they live.

A point to emphasize is that the similarity in names is a simple and obvious fact, not a matter of speculation. The writer is not a linguist and the reader does not need to have the specialized knowledge of linguists. In brief, if names sound the same then they probably are the same or "If it looks like a duck, walks like a duck and sounds like a duck then it is probably a duck".

We anticipate that there will be many skeptics regarding our claim for worldwide similar names and so we immediately present two of the simplest and clearest examples of the thousands of lists of links that we have discovered.

The first example shows the remarkable spread of names for Breast of the form "T-T-". For those unfamiliar with Greenberg’s analysis of American languages, Na-Dene, Almosan-Keresiouan, Macro-Carib, Macro-Panoan, and Andean are all American language groups. This Table shows the possible variations in spelling and also shows the widespread name of TITI that may thus be the oldest form of the name.

BREAST T-T

     

N. EURASIA

 

NA-DENE

 

ENGLISH

TITTY

YELLOWKNIFE

TTHU, TTHUWE

SPANISH

TETA

CHIPEWYAN

-TTHUWE,

BASQUE

TITI

ALMOSAN-KERESIOUAN

 

SEAS

 

CREE

TO-TOOS

PALAU

TUT

OJIBWA

TOTOHSIMAHK

PHILIP

TUTUQ, TUTUQ

MACRO-CARIB

 

MALAYA

TETEK

ANDOKE

TUTANE

INDONESIA

TETEK

MACRO-PANOAN

 

MARSHALL

ITTUT

GUAICURU

OTE-TE, TE-TE

KUSAIE

TITI

TOBA

TETE, TITI

YAP

THUUTH

ANDEAN

 

TRUK

TTU

CHAYAHUITA

TE'TE'TE

PULUWAT

TUUT

AUSTRALIA

 

WOLEAI

TIUT, TTIU

NYUNGAR2

TITI

MID EAST

 

WATJARRI

TITI

AMHARIC

TUT

PAPUA

 

HARARI

TOT

P-MARIN1

TOTO, TOTO

S. AFRICA

 

P-KATI

TIT KONO

BANTU

TITI

P-AWIN

TUTE

   

Bedamini

TOTO

       

 

The second example is equally remarkable but of a completely different form. Here we show where there are pairs of complex names found in widely separated regions. Here we have arranged the data to show some immediately clear conclusions. In particular we have placed the "African" ME or Middle East and the single S.A. examples in the left hand columns since there are clearly implications regarding the "Out of Africa" thesis. We have also placed the American (Am or Amer) examples on the right. These data are clearly worthy of further analysis in conjunction with many others of our studies but the dominance of the Middle East and the Americas is immediately obvious.

STRIKING PAIRS OF NAMES (All Mid East and S. Africa names in left-hand column, Americas included on right)

Bodypart

Language

Name

Language

Name

Armpit

Tedadaza ME

KILIKILI

Pitta Aust

KILYIKILYI

     

Tagalog Seas

KILKILI

Blood

Nandi ME

KOROTI

Dravidian Seas

KURUTI

Brain

Alya Aust

PAMPU

Macro-Ge Am

POMPA

Chin

Morumadi ME

NYEKEMU

Kwamera Seas

NIKUMU

Clitoris

Mbara ME

MEDEW

Nukuoro Seas

ME DUU

Ear

Aleut Euras

TUTUSIX

Penutian Am

TATSK

Elbow

Hausa ME

KURUWAN

HANNU

Nengone Seas

KAREWANIN

Excrement

Basque Euras

SIMAUR

Andean Amer

SAAMARA

Eye

Ainu Seas

KERUP

Kati Papua

KERAP

Flesh

Yap Seas

QUFIN

Na-Dene Amer

KFWEN

Forehead

Ainu Seas

KEPUTURU

M-Tucan Amer

KAPETIIRO

Hair

Shilluk ME

WENO-WEN

Puluwat Seas

WUNAWUN

Hair

Tedadaza ME

COMOSO

Newari Seas

CIMI-SA

Head

Daly-4 Aust

TEPPL

Andean Amer

TEPEL

Heel

Kalka Aust

WANTU

Andean Amer

WINTU

Heel

Hausa ME

DIDDIGI

Chamorro Seas

DEDEGGO

Heel

Morumadi ME

BEREDE

Macro-Ge Am

BUREADO

Jaw

Breton Euras

KARVAN

Andean Amer

KORUEN

Lip

Morumadi ME

TIMBIRI

Cent Amer Am

TEMBERIA

Lungs

Sanscrit Euras

PUPPHUSA

Paezan Amer

PHOPHOSOA

Mucus

Limba S.A.

NAYELENMA

Victor 1, Aust

NJELINJUG

Penis

Hausa ME

TALOLO

Penutian Amer

TOLOLO

Penis

Hausa ME

GANDARI

Warndar Aust

GINDIR

Penis

Emgan Papua

PONGO

Macr-Carib Am

POENGO

Skull

Basque Euras

BURUHEZUR

Macr-Carib Am

BARUHXORA

Spittle

Andaman Seas

AKWENE

Andean Amer

KOWEN

Thumb

Tagalog Seas

HINLALAKI

Hokan Amer

HINOLEKU

Thumb

Semang Seas

BA CHAS

Palau Seas

BECHOS

Vein

Manchu Euras

DARGIYA

Koita Papua

DOROGEA

Waist

Burji ME

GUNDUREE

Djinang Aust

GANDARR

Womb

Mayi Aust

MIRRIYN

Macr-Carib Am

MURE YEN

 

 

 

DATA COLLECTION AND ORDERING THE NAMES

Collecting and ordering the names so that they could be conveniently analyzed was an onerous task that occupied several years. It was greatly aided by following an orderly approach and constructing computer-based (Microsoft Word) files. It consisted of several parts, first collecting the initial vocabularies in "Source Lists", then collecting the names for each bodypart in "Primary Lists", and then grouping the information in each bodypart Primary List to group names that began with the same initial letter forming our "Working Lists".

Inside each Working list the names beginning with a given letter were then grouped to show two and three syllable "Doubles" and "Triples". The Doubles lists and the included Triples are the basic Working Lists of our analyses.

Source Lists: The first task was to collect the initial "Source Data" for each suitable language. Here we listed the available names for 120 different parts of the body in alphabetic order. Only a small number of vocabularies exist in widely available formal dictionaries and most of this information exists in university libraries in specialized journals that are linguistic or anthropological in scope. The vocabularies were very varied in completeness and different names were sometimes found from different sources. Where subtleties of pronunciation were present they were necessarily ignored and the names were recorded as the best "reasonable" English spelling.

From these Source Lists it was clear that some names were infrequent and a list of the most usual one hundred body parts was selected. All the names were then collected in one hundred lists for each part. Here names were grouped according to similar linguistic and geographical associations.

It should be noted that these files are "open ended" since additional vocabularies can be (and have been) added as desired though carrying all new data forward into other files is a formidable task.

Primary Lists: The hundreds of Source Lists in themselves are inconvenient to use directly and it is necessary to collect all the names for a given bodypart so that they may be conveniently compared and analyzed. Thus we have constructed 100 "Primary Lists" where all the names for a bodypart, say "arm", are brought together. In these lists we group all the names for each of our selected regions, say Northern Eurasia, in the order we have listed above. Thus, for example, all the Indo European names are collected together and readily compared. Similar comparisons can be made for other regions and language groups, for example for "Austronesian" languages of South East Asia.

Working Lists: Inspection of the Primary Lists immediately showed that there were many different names for every bodypart and so are not suitable for seeing if there are similar names in different languages. Consequently we re-grouped all the entries in the Primary Lists according to the initial letter in the name. This was a very tedious but essential task and these are the most important lists in our study since they contain the specific information used in all of our studies.

Here we emphasize that it is the initial consonant that is used, not any initial vowel that might be present. This is because our many studies of words have shown that it is overwhelmingly true that similar words are defined by the sequence of consonants, and that the precise vowel that completes the syllable is usually of little importance. On the lighter side, these facts may be some support for the humorous, if sarcastic, comment attributed to Voltaire that etymology is a science where "the consonants count for little and the vowels for nothing". We agree on the vowels but not on the consonants.

Despite this emphasis we must admit that in some cases the similarity in vowels is so striking that we have included vowel similarities in some striking cases.

Under a given initial letter we have ordered the names according to linguistic groupings so that, for example, all Indo-European names beginning with "k" are listed together.

Here we have found that it is not necessary to list according to every letter of the alphabet and that certain initial letters are more frequently found than some others are. This is a convenient point to recognize the probable linkage between some letters, and we use the well-known fact that when names change they tend to do so in established ways. For example "t" preferably changes to "d" (and vice-versa) rather than to another letter. We also find similar changes between "p" and "d", "m" and "n", and some other letters. In brief we collect the names in thirteen groups where we list the names beginning with "t" and then "d", "m" and "n", "k" and "c" and "q" all pronounced as "k", "g", "c" or "ch", "p" and "b", "s" and "z" grouped together, "l", "r", and finally "j" and "y" grouped as a pair.

Further Grouping of Words; Doubles, Triples and Quads:

Having all names for a given bodypart in a single list is convenient but this is still not sufficient to show the close similarity of many names. For example some lists are cumbersome and unclear and in one extreme case we see that there are 170 names for "nose" that begin with the letter "N". Consequently the last task is to collect together names where the second, third and even fourth consonant (or syllable) is the same.

Thus at the end of each "working list" of identical first letters we collect the examples where the first two consonants are the same, then, as appropriate, we separately collect those with three or even four similar consonants. In short hand terms we describe these as "doubles", "triples", and "quads". As expected the lists become progressively shorter as the names become more complex

Taking the instance of 170 examples of "initial N" for Nose, in about 40 of these the initial "N" is followed by "S" or "Z", while there are three groups of about 20 doubles where the initial "N" is followed by "K", "N", or "R". In other words, roughly half of the names are of a "two-syllable" form, either "N-S", "N-K", "N-N", or "N-R". And within each two-syllable group, there are between three and seven examples of "three-syllable" names of the form "N-S-K", "N-K-R", and "N-R-S". In this case these are no "four-syllable" examples.

ANALYSIS OF THE LISTS:

As we have noted, the basic lists used in our analyses are the "doubles". The numbers of such doubles can vary considerably for different bodyparts. In part this is because certain vocabularies which have a sexual connotation are not known. This may be because some of the earlier researchers may have avoided these topics and also perhaps because women might have a reluctance to discuss these with the male investigators. However, even the shorter bodyparts lists contain more than 20 items and many others have 70 or more. The total number of such links is about 5400 or an average of about 50 per language. Inside these lists are about 1600 triples. In other words, there are about 5400 lists of links that show how the languages of the world are related.

As can be seen the total amount of information contained in these lists is very formidable; in computer terms it is over 100 Megabytes. The reader must appreciate the strength of using a computer to store the information so that one may shuffle the names essentially as one wishes and group and re-group the data as appears necessary to address a given question. However it cannot be emphasized too strongly that the analysis of these data is not simply a matter of "feeding it into the computer". In the writer’s experience there can be major mistakes in uninformed and uncritical acceptance of computer-based data "analysis" simply because the data can be presented in an sophisticated manner. There is no known method of substituting such approaches for good judgement.

Basic Principle: All of our analyses begin by collecting all "doubles" names (that automatically include any Triples or Quads) for all bodyparts and a given language or a given region in a single document. Thus we produce documents entitled as "All Basque Links" or "All Papua Links" that contain all such connections worldwide. Such a procedure would be impracticable without computer-based storage of the data, and even so it takes several days to compile a single "All Language" list and far longer for regional lists.

These documents are analyzed in many different ways depending upon the issue or question that is being addressed and interesting and/or significant points raised by the content of the lists themselves. In general each "All Language" list is used in many different studies.

We have found it useful to summarize the broad linkage of a single language or region in these documents in a semi-standardized listing of the number of links with the major regions (discussed later), that is with the Americas, SEAS, Australia, Papua, the Mid East, and Eurasia. The Americas links are usually complied from those with each of the thirteen language groups defined by Greenberg ("Language in the Americas"). In these lists the total number of links is given but we also make an allowance for the differences between the regions by dividing the total number of links by the number of languages in each region. This is not an unambiguous procedure and is discussed in detail in several of our studies.

Comment on Data: The use of computer based files and the massive presentation of linked names is not simply a matter of "overkill" in showing worldwide linkage between languages since the large number of examples allows analyses that are not possible with a small sample. In particular the magnitude of the links is vitally significant in such issues as the strong linkage of the Americas to all of the languages of the Old World. We believe that the statement that there is a 60% or 70% linkage of a given language with the Americas has far more significance than "there are strong links".

Very importantly with such a large set of data we can compare and analyze linkages between entire regions of the world. For example we have listed all links between Africa and the entire rest of the World, and we have also listed all links of the Americas with the rest of the world. In this manner it is possible to show how languages are linked across the World and address the questions of large-scale migration in a comprehensive manner.

This statement of the important aspects of worth of such a large set of data should not be taken as a statement that we have resolved or understand the overall picture presented by languages. We have cast some light on certain issues, particularly the roles of Africa and the Americas, but in general we have been able to better present the questions rather than supply the answers.

Finally, we cannot emphasize too strongly that we believe that publication of the source data and the hundreds of derived documents is beyond presentation in simple paper form. Apart from the sheer bulk (literally many hundreds or even thousands of pages) of printed pages, even if the printed data was available it could not be usefully viewed, reviewed, and analyzed with printed sources. Useful publication of anything other than summaries is only likely to be practical by including computer readable files together with the printed text.

RETURN TO CONTENTS