본문 바로가기
기본적인 개념

[기본개념] 시퀀싱 파일이름 의미!

by 인포메틱스 2020. 6. 8.
반응형

 NCBI에서 유전자들을 찾아볼 때, Reference Sequence를 확인할 수가 있다.

 

여기서 Reference sequencing을 보게되면 NM_머시기라고 되어있는것을 확인할 수가 있다.

 

 

가끔 NM말고도 다른 prefix를 보인는데 다음과 같이 정리하였다.

Accession Prefix Molecule type Comment
AC_ Genomic Complete genomic molecule, usually alternate assembly
NC_ Genomic Complete genomic molecule, usually alternate assembly
NG_ Genomic Incomplete genomic region
NT_ Genomic Contig or scaffold, clone-based or WGS
(Whole Genome Shotgun sequence data)
NW_ Genomic Contig or scaffold, primarily WGS
(Whole Genome Shotgun sequence data)
NZ_  Genomic Complete genomes and unfinished WGS data
NM_ mRNA Protein-coding transcripts (usually curated)
NR_ RNA Non-protein-coding transcripts
XM_ mRNA predicted model protein-coding transcript
XR_ RNA Predicted model non-protein-coding transcript
AP_ Protein Annotated on AC_alternate assembly
NP_ Protein Associated with an NM_ or NC_ accession
YP_ Protein Annotated on genomic molecules without an instantiated transcript recode
XP_ Protein Predicted model, associated with an XM_accession
WP_ Protein Non-redundant across multiple strains and species

 

출처 : ncbi.nlm.nih.gov/books/NBK21091/table/ch18.T.refseq_accession_numbers_and_mole/

 

Table 1. [RefSeq accession numbers and molecule types.]. - The NCBI Handbook - NCBI Bookshelf

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health. McEntyre J, Ostell J, editors. The NCBI Handbook [Internet]. Bethesda (MD): National Center for Biotechnology Information (US); 2002-. This publication is provide

www.ncbi.nlm.nih.gov

 

728x90
반응형

댓글