Data availability and cooperation possibilities
The creation of typological databases involves an enormous amount of work. Based on our workload and funding records, we estimate the total production cost of the AUTOTYP network of databases to be about a million euros (as of 2009). This requires a careful policy on sharing data or data structures/database methods. We are generally following the standards in other disciplines.
Generally accessible datasets
- Genealogy and Geography. Our data on genealogical and geographical information is available for download as a comma-separated plain text file and as a geography file (*.kmz) that can be viewed in Google Earth. In addition, we provide Google Earth files illustrating our geographical area coding (both for larger, continent-sized areas and for smaller areas). Please read the documentation before using the data and make sure to cite them properly, as explained in the documentation. The data can also be accessed through an on-line search interface. The data are continuously updated and there will be updated releases in the future. Suggestions for improvement of our coding is always welcome [mail to bickel at uni-leipzig.de and johanna at berkeley.edu]
2009 release:
On-line Search Interface
- Language Reports. For some (more recent) database modules, you can access language reports. These are filled-out questionnaires, sometimes enriched by new analyses, and are based on our own field research data or on published material. Note that before you can download language reports, you will have to agree to our citation policy.
Published (deposited) datasets
1. World Atlas of Language Structures
The following datasets are published in the World Atlas of Language Structures. Note that they are subsets of our data, exported in 2003. Our current datasets are generally larger. Use of these data requires nothing but the usual reference to our chapters in the Atlas.
2. Electronic appendices to published work
The following datasets are available for download. Their use requires no more than the usual reference to the publication that discusses the data.
| Variable |
From Project |
Number of Languages |
Publication |
| Inclusive/exclusive subtypes |
Person systems |
293 |
Bickel, B. & J. Nichols, 2005. Inclusive/exclusive as person vs. number categories worldwide. In: Elena Filimonova [ed.] Clusivity. Amsterdam: Benjamins |
| Phonological word domains |
Word domains |
70 |
Bickel, B., K. Hildebrandt, & R. Schiering. 2009. The distribution of phonological word domains: a probabilistic typology. In Grijzenhout, J. & B. Kabak (eds.) Phonological Domains: Universals and Deviations. Berlin: Mouton de Gruyter. [DRAFT] |
Other datasets: cooperation projects
If you want to use (i.e. browse or analyze) any other AUTOTYP datasets (data files or definition files), it is possible to get access to the database if you agree to a Code of Conduct on Fair Use that regulates intellectual property rights and coauthorship structures. Please contact Balthasar Bickel or Johanna Nichols if you are interested in this kind of cooperation and we will send you our Code of Conduct on Fair Use.
The AUTOTYP Method
If you want to simply adopt the AUTOTYP method and principles, all we ask you is to refer to Bickel & Nichols 2002 in publications using the new database, a report on the new database and on any publications resulting from it. We will then crossreference your project from our website. (Note: The 2002 paper will be updated soon, so please check back for newer versions.)