assign POP: An R package for population assignment using genetic, non-genetic, or integrated data in a machine-learning framework

View Researcher's Other Codes

Disclaimer: The provided code links for this paper are external links. Science Nest has no responsibility for the accuracy, legality or content of these links. Also, by downloading this code(s), you agree to comply with the terms of use as set out by the author(s) of the code(s).

Please contact us in case of a broken link from here

Population Assignment using Genetic, Non-Genetic or Integrated Data in a Machine-learning Framework.

Authors Chen, K. Y. et al.
Journal/Conference Name Methods in Ecology and Evolution
Paper Category
Paper Abstract 1.The use of biomarkers (e.g., genetic, microchemical, and morphometric characteristics) to discriminate among and assign individuals to a population can benefit species conservation and management by facilitating our ability to understand population structure and demography. 2.Tools that can evaluate the reliability of large genomic datasets for population discrimination and assignment, as well as allow their integration with non-genetic markers for the same purpose, are lacking. Our R package, assignPOP, provides both functions in a supervised machine-learning framework. 3.assignPOP uses Monte-Carlo and K-fold cross-validation procedures, as well as principal component analysis (PCA), to estimate assignment accuracy and membership probabilities, using training (i.e., baseline source population) and test (i.e., validation) datasets that are independent. A user then can build a specified predictive model based on the relative sizes of these datasets and classification functions, including linear discriminant analysis, support vector machine, naive Bayes, decision tree, and random forest. 4.assignPOP can benefit any researcher who seeks to use genetic or non-genetic data to infer population structure and membership of individuals. assignPOP is a freely available R package under the GPL license, and can be downloaded from CRAN or at A comprehensive tutorial can also be found at
Date of publication 2018
Code Programming Language R

Copyright Researcher 2022