Balthasar Bickel's Software Links


--    Randomization tests for typology (R package), based on
        Janssen, Bickel & Zúñiga, 2006. Randomization tests in language typology, Linguistic Typology 10, 419 - 4

--   Genealogical sample algorithm (source code), based on
        Bickel, Balthasar, 2008. A refined sampling procedure for genealogical control. Sprachtypologie und Universalienforschung 61, 221 - 33.

--   R script reading Toolbox files into R (source code)
       
--   Shell scripts transforming CHAT files into Toolbox files, for import into R (two bash scripts in one zip archive; one script for CHAT files with headers; one for CHAT files without headers
       
--   Convenience functions for searching corpora in R (source code). See explanations for use with CPDP corpora.
       
--   Convenience functions for counting words in toolbox corpora from within R (source code).
       
--   R script for extracting, tabulating and exporting IMDI metadata in various formats (source code).
       
--   Many other convenience functions for working with CPDP corpora and other stuff (source code) .
       

Hints, manuals etc.

--   E-Z-R: an introduction to R for typologists
       
--   Grep and other shell tools for CPDP users
       
--   Analyzing CPDP corpora in R
       
--   Our wiki with tips on computing such things as MLUs, compiling dictionaries, and aggregating data by speaker and recording cycle etc.