Differentiating amino acids from nanopore sequencing

Jiahao Zhang, Jia Meng, Yuxin Zhang

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

Amino acids nanopore sequencing is a significant breakthrough in the fields of molecular biology, biochemistry, and medical diagnostics.The tool's high sensitivity, specificity, and real-time analytic capability make it essential for accurately identifying amino acids.A new nanopore, known as Msp-NTA-Ni, has recently advanced the bounds by allowing accurate differentiation of all 20 proteinogenic amino acids and their post-translational modifications (PTMs).Utilizing the data produced by this nanopore, our research conducted a thorough examination of five features, pinpointing the most useful pairs for the purpose of classification.Subsequently, we undertake an elaborate process that encompasses the training, fine-tuning, and comparative evaluation of multiple machine learning models, such as Random Forest, CatBoost, and SVM.The results of our research indicate that the Random Forest model surpasses the current benchmarks, obtaining a validation accuracy of 99.04%.Moreover, our research emphasizes the crucial significance of particular combinations of features, such as the mean and standard deviation, in improving the performance of the model, despite some limitations in differentiating between certain pairs of amino acids.

Original languageEnglish
Title of host publicationICBBT 2024 - Proceedings of the 2024 16th International Conference on Bioinformatics and Biomedical Technology
PublisherAssociation for Computing Machinery
Pages25-30
Number of pages6
ISBN (Electronic)9798400717666
DOIs
Publication statusPublished - 18 Nov 2024
Event16th International Conference on Bioinformatics and Biomedical Technology, ICBBT 2024 - Chongqing, China
Duration: 24 May 202426 May 2024

Publication series

NameACM International Conference Proceeding Series

Conference

Conference16th International Conference on Bioinformatics and Biomedical Technology, ICBBT 2024
Country/TerritoryChina
CityChongqing
Period24/05/2426/05/24

Fingerprint

Dive into the research topics of 'Differentiating amino acids from nanopore sequencing'. Together they form a unique fingerprint.

Cite this