Imputation of Missing Genotypes with Intelegent K-Nearest Neighbore Algorithm

Author(s):

Fatemeh Vanaei , Farhad Ghafouri-Kesbi* , Pouya Zamani , Ahmad Ahmadi

Message:

Article Type:

Research/Original Article (دارای رتبه معتبر)

Abstract:

Introduction and Objective

Genotype imputation in genomic selection schemes has been considered by researchers in recent years because it can reduce the costs of genomic selection without having a negative impact on the accuracy of genomic selection. In the genotype imputation process, markers that their genotypic information has been missed for any reason are imputed using various statistical methods.

Material and Methods

To constructe genotypic matrix, a one morgan genome including one chromosome for 250 and 1000 individuals was simulated on which in different scenarios 250, 500, 750, 1000, 1500 and 2000 single necleotide polymorphismes (SNP) was distributed. In order to create genomic matrix including missing genotypes, genotypic information of respectively, 5%, 10%, 25%, 50%, 75% and 90% of SNPs was masked and then imputed with KNN. The percent of genotypes correctly imputed (the ratio of genotypes correctly imputed to total masked genotypes) as well as the correlation between primary genotypic matrix (no missing genotype) and imputed genotypic matrix were used as imputation accuracy.

Results

In the population including 250 individuals, the accuracy of imputation in the scenarios of 5%, 10%, 25%, 50%, 75% and 90% missing genotypes, were 0.82, 0.82, 0.80, 0.76, 0.62 and 0.40, respectively, but by increasing the size of the population to 1000 individuals, the imputation accuracies as 0.83, 0.83, 0.82, 0.82, 0.71 and 0.54 were obtained which in the scenarios of 75% and 90% of missing genotypes the increase in imputation accuracy was noticable. The correlation between the primary genotype matrix and the imputed genotypic matrix also decreased with increasing percentage of missing genotypes. In a fixed population size, by increasing the number of SNP from 250 to 2000, imputation accuracy increased from 0.67 to 0.84. In addition, an inverse relationship was observed between MAF and imputation accuracy in a way that by increasing MAF from 0.01 to 0.5, imputation accuracy decreased by 15%. Computation time increased following increase in dimension of genotypic matrix. Bu increasing the percent of missing genotypes, the accuracy of predicted genomic breeding values decreased. In the scenarios of 5 and 10% of missing genotypes, no change in accuracy was observed, but in the scenarios of 75 and 90% of the missing genotypes, the accuracy of prediction of breeding values decreased by 16 and 32%, respectively.

Conclusion

In general, imputation accuracy of KNN was acceptable in such a way that up to 50% of missing genotypes, KNN imputed missing genotypes with 80% accuracy and therefore one could recommend this algorithm for genomic selection schems.

Keywords:

Genotype imputation , K-nearest neighbor , Minor allel frequency , Single nucleotide polymorphism

Language:

Persian

Published:

Research On Animal Production, Volume:13 Issue: 35, 2022

Pages:

130 to 138

https://www.magiran.com/p2457426

دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:

اشتراک شخصی

با ثبت ایمیلتان و پرداخت حق اشتراک سالانه به مبلغ 1,950,000 ريال، بلافاصله متن این مقاله را دریافت کنید.اعتبار دانلود 70 مقاله نیز در حساب کاربری شما لحاظ خواهد شد.

پرداخت حق اشتراک به معنای پذیرش "شرایط خدمات" پایگاه مگیران از سوی شماست.

پست الکترونیکی

اگر مقاله ای از شما در مگیران نمایه شده، برای استفاده از اعتبار اهدایی سامانه نویسندگان با ایمیل منتشرشده ثبت نام کنید. ثبت نام

اشتراک سازمانی

به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!

اطلاعات بیشتر ثبت نام با ایمیل دانشگاهی/سازمانی

توجه!

حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.

In order to view content subscription is required

Personal subscription

Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.

Organization subscription

Please contact us to subscribe your university or library for unlimited access!

More information

سامانه نویسندگان

Author (3)

Pouya Zamani

Professor Department of Animal Science, Bu-Ali Sina University, Hamedan, Iran
Author (4)

Ahmad Ahmadi

Assistant Professor Animal Sciences, Bu-Ali Sina University, Hamedan, Iran

اطلاعات نویسنده(گان) توسط ایشان ثبت و تکمیل شده‌است. برای مشاهده مشخصات و فهرست همه مطالب، صفحه رزومه را ببینید.

مقالات دیگری از این نویسنده (گان)

Effects of Different Sources of Supplemental Zinc on the Performance and Some Blood Parameters of Holstein Suckling Calves
Leyla Cheraghi Mashoof, Hassan Aliarabi*, Daryoush Alipour, Pouya Zamani
Research On Animal Production,
The effect of model structure on the model performance to fit milk production data in Isfahan Holstein cows
Sajad Gholizadeh, Pouya Zamani, Farhad Ghafouri-Kesbi *
Journal of Livestock Science and Technologies, Dec 2023
Semen quality, plasma testosterone, and trace element concentrations in response to dietary supplementation of an organic versus an inorganic source of zinc in Mahabadi bucks
Hamidreza Taghian, Hassan Aliarabi *, Abbas Farahavar, Morteza Yavari, Khalil Zaboli, Ahmad Ahmadi
Journal of Livestock Science and Technologies, Dec 2023
Comparing genomic prediction models for genomic selection of traits with additive and dominance genetic architecture
Seyed Javad Khorami, Farhad Ghafouri-Kesbi *, Ahmad Ahmadi
Journal of Livestock Science and Technologies, Jun 2023

علمی مصوب

فصلنامه پژوهشهای تولیدات دامی

Research On Animal Production

فصلنامه کشاورزی و منابع طبیعی به زبان فارسی و انگلیسی

آخرین شماره | آرشیو

ISSN: 2251-8622 eISSN: 2676-461X

صاحب امتیاز:

دانشگاه علوم کشاورزی و منابع طبیعی ساری

مدیر مسئول:

دکتر منصور رضایی

سردبیر:

دکتر قدرت الله رحیمی میانجی

تلفن نشریه: ۰۱۱-۳۳۶۸۷۴۳۷

اطلاعات بیشتر نشریه

درباره نشریه پیام به نشریه سایت اختصاصی نشریه پذیرش الکترونیکی مقاله راهنمای نویسندگان