# HG changeset patch # User rico # Date 1333653015 14400 # Node ID feeeee14729ea238c901847bf32e2d11be69059f # Parent 884befe75dcb15b254b48cb9c2a91b012af7ec02 Deleted selected files diff -r 884befe75dcb -r feeeee14729e evaluate_population_numbers.xml --- a/evaluate_population_numbers.xml Thu Apr 05 15:09:12 2012 -0400 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,54 +0,0 @@ - - possible numbers of populations - - - evaluate_population_numbers.bash "${input.extra_files_path}/admix.ped" "$output" "$max_populations" - - - - - - - - - - - - - - - - - - - - - - - - - - - -**What it does** - -The users selects a set of data generated by the Galaxy tool to "prepare -to look for population structure". For all possible numbers K of ancestral -populations, from 1 up to a user-specified maximum, this tool produces values -that indicate how well the data can be explained as genotypes from individuals -derived from K ancestral populations. These values are computed by a 5-fold -cross-validation procedure, so that a good choice for K will exhibit a low -cross-validation error compared with other potential settings for K. - -**Acknowledgments** - -We use the program "Admixture", downloaded from - -http://www.genetics.ucla.edu/software/admixture/ - -and described in the paper "Fast model-based estimation of ancestry in -unrelated individuals" by David H. Alexander, John Novembre and Kenneth Lange, -Genome Research 19 (2009), pp. 1655-1664. Admixture is called with the "--cv" -flag to produce these values. - -