NCBI Reference Sequence: XP_026756396.1
Identical Proteins FASTA Graphics
LOCUS XP_026756396 699 aa linear INV 19-DEC-2019
DEFINITION arylphorin subunit alpha-like [Galleria mellonella].
ACCESSION XP_026756396
VERSION XP_026756396.1
DBLINK BioProject:PRJNA498111
DBSOURCE REFSEQ: accessionXM_026900595.1
KEYWORDS RefSeq; includes ab initio.
SOURCE Galleria mellonella (greater wax moth)
ORGANISMGalleria mellonella
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata;
Ditrysia; Pyraloidea; Pyralidae; Galleriinae; Galleria.
COMMENT MODELREFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_022269380.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name ::Galleria mellonella AnnotationRelease 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version ::8.3
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
ab initio :: 2% of CDS bases
##RefSeq-Attributes-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..699
/organism="Galleria mellonella"
/isolate="Carbio01_MB"
/db_xref="taxon:7137"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole adult"
/geo_loc_name="USA: Urbana-Champaign, Illinois, Berenbaum
Laboratory"
/collection_date="2017-12-13"
/note="laboratory colony"
Protein 1..699
/product="arylphorin subunit alpha-like"
/calculated_mol_wt=83523
CDS 1..699
/gene="LOC113516215"
/coded_by="XM_026900595.1:1..2100"
/db_xref="GeneID:113516215"
ORIGIN
1 mffnlwfhcn svtvyfltey filnnlfavd pnlvniqkkv llllenwkqv dpddeyykig
61 keynieanie sytnrevvte flslyktgft aknqifsiyy enqalevral yrlfyyakdf
121 etfyktaafa rvwlnegqfi yafyiavihr adtrgivlpa pyeiwpeyfv nsdvlakinr
181 iqmqkglilp etaqyygvla kdnayyfyan ysgpwtyenn enllsyfied vawnsyyyyf
241 hsklqfwekg enaigpfker rgeiyyfiyq qilaryyler lsnglgeipr fnwndrlqag
301 yyplltthqi pfaqrngdyy landdniedi qfvdsyektf lqflqkgqfk aykqevdlyn
361 sksvnfvgny wqanvdlyek vpqrnylrsy edaarrilga aprnsyenln vptaldfyqt
421 slrdpafyql yakildfinq ykeylepytq dvlhfvgvki ndvkvdklvt yfeyfdwnat
481 navylseqql dtgspsyivr qprlnnqpft vtidiksdve seavikifig pkydgngypi
541 dlennwvnlv eidwfthklt sgqnkierks enffwfkeds vsvskiyell nngqvprymi
601 ekflllprrl llprgteggv pfqffvfvyp yqapykewep mkefvvdnkp fgypfdrpvt
661 esyyftqpnm yfkdvyiyqe geeypyytsy wsqnqvpkh
//

MFFNLWFHCNSVTVYFLTEYFILNNLFAVDPNL
VNIQKKVLLLLENWKQVDPDDEYYKIGKEYNIEANIESYTNRREVVTEFLSLYKTGFTAKNQIFSIYYENQALEVRALYRLFYYAKDFETFYKTAAFARVWLNEGQFIYAFYIAVIHRADTRGIVLPAPYEIWPEYFVNSDVLALKINRIQMQKG
LILPETAQYYGVLAKDNAYYFYANYSGPWTYENNENLLLSYFIEDVAWNSYYYFHSKLQFWEKGENAIGPFKERRGEIYYFIYQQILARYYLERLSNGLGEIPRFNWNDRLQAGYYPLLTTHQIPFAQRNGDYYLANDDNIEDIQFVDSYEKTFLQFLQKGQFKAYKQEVDLYNSKSVNFVGNYWQANVDLYEKVPQRNYLRSYEDAARRILGAAPRNSYENLNVPTALDFYQTSLRDPAFYYQLYAKILDFINQYKEYLEPYTQDVLHFVGVKINDVKVDKLVTYFEYFDWNATNAVYLSEQQLDTGSPSYIVRQPRLNNQPFTVTIDIKSDVESEAIVIKIFIGPKYDGNGYPIDLENNWWNLVEIDWFTHKLTSGQNKIERKSENFFWFKEDSWSVSKIYELLNNGQVPRYMIEKFLLLPRRLLLPRGTEGGVPFQFFVFVYPQYAPYKEWEPMKEFVVDNKPFGYPFDRPVTESYYFTQPNMYFKDVYIYQEGEEYPYYTSWQNQPVKH
MKFKLNLITLALLANTGLAVAGSGSGSGSGSGFFNLWFHCNSVTVYFLTEYFILNNLFAVDPNL GSGSGSGSGSGTATCTSCHAPESIGHGLENMGAIVNGDYVQANQAAQSETCFYCHKPTPTDHTQVKMGSGSGSGSGSGLILPETAQYYGVLAKDNAYYFYANYSGPWTYENNENLLLSYFIEDVAWNSYYYFHSKLQFWEKGENAIGPFKERRGEIYYFIYQQILARYYLERLSNGLGEIPRFNWNDRLQAGYYPLLTTHQIPFAQRNGDYYLANDDNIEDIQFVDSYEKTFLQFLQKGQFKAYKQEVDLYNSKSVNFVGNYWQANVDLYEKVPQRNYLRSYEDAARRILGAAPRNSYENLNVPTALDFYQTSLRDPAFYYQLYAKILDFINQYKEYLEPYTQDVLHFVGVKINDVKVDKLVTYFEYFDWNATNAVYLSEQQLDTGSPSYIVRQPRLNNQPFTVTIDIKSDVESEAIVIKIFIGPKYDGNGYPIDLENNWWNLVEIDWFTHKLTSGQNKIERKSENFFWFKEDSWSVSKIYELLNNGQVPRYMIEKFLLLPRRLLLPRGTEGGVPFQFFVFVYPQYAPYKEWEPMKEFVVDNKPFGYPFDRPVTESYYFTQPNMYFKDVYIYQEGEEYPYYTSWQNQPVKHGSGSGSGSGSGVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKKPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYK
Beginning - MtrB signal peptide https://www.uniprot.org/uniprotkb/Q8CVD4/entry#sequences end - mcherry - https://www.addgene.org/176016/sequences/
All linked with flexible linkages GSGSGSGSGSG
Including MtrC heme binding domain
ATGAAGTTTAAACTGAACCTGATTACCCTGGCGCTGCTCGCCAACACGGGCCTTGCCGTCGCAGGGTCCGGCAGTGGCAGCGGATCAGGATCTGGCTTTTTCAATCTGTGGTTTCATTGCAACTCGGTGACGGTGTATTTTCTAACGGAATATTTCATTCTTAACAATCTGTTCGCGGTGGATCCCAACCTAGGTAGTGGGTCGGGCAGCGGTTCTGGGAGTGGCACAGCGACATGTACTTCATGCCACGCACCGGAGTCGATCGGTCATGGTTTGGAAAACATGGGTGCGATTGTCAACGGGGATTATGTTCAGGCAAATCAAGCGGCTCAGTCTGAAACCTGCTTTTACTGTCACAAACCGACTCCCACTGATCATACGCAGGTTAAAATGGGAAGCGGCAGTGGTTCTGGTAGCGGTTCGGGCCTGATACTGCCGGAAACGGCCCAGTATTATGGGGTCTTAGCAAAAGATAATGCGTACTATTTTTATGCAAACTATAGCGGCCCATGGACTTATGAGAACAATGAAAATTTACTCCTGAGCTATTTTATTGAAGACGTGGCTTGGAATTCTTATTACTACTTCCACTCAAAACTTCAGTTCTGGGAGAAAGGCGAAAACGCCATCGGGCCGTTCAAGGAACGCCGCGGAGAAATTTACTACTTTATTTATCAGCAAATTCTGGCGAGGTATTATTTAGAACGCTTGTCGAATGGGCTCGGCGAGATCCCACGTTTTAACTGGAACGATCGGTTGCAGGCTGGTTACTATCCTCTCCTGACGACCCACCAGATTCCGTTTGCGCAACGTAATGGCGATTATTATCTCGCGAATGATGATAATATTGAAGACATCCAATTCGTTGATTCATATGAAAAGACGTTCCTGCAGTTCCTCCAAAAAGGGCAATTCAAAGCGTATAAACAGGAAGTGGATTTATATAACTCCAAATCCGTCAACTTTGTGGGCAATTACTGGCAGGCGAATGTCGACCTGTACGAGAAAGTGCCACAACGAAACTATTTGCGCAGTTACGAAGACGCAGCACGTCGTATCCTGGGTGCCGCCCCCCGTAATAGCTATGAAAATCTGAATGTTCCGACTGCGTTAGATTTTTATCAGACCTCCCTGAGAGACCCGGCATTTTACTACCAACTTTACGCTAAAATCCTGGATTTTATTAACCAGTATAAAGAGTACCTGGAACCGTACACCCAAGACGTGCTGCACTTCGTGGGCGTCAAAATTAATGATGTGAAAGTTGACAAGCTTGTTACCTACTTTGAGTATTTTGACTGGAACGCCACAAACGCAGTGTACCTTAGCGAACAGCAGCTGGATACCGGAAGTCCGAGCTACATTGTGCGTCAGCCGCGCTTGAATAATCAGCCGTTTACAGTAACCATTGATATCAAATCGGATGTGGAAAGTGAAGCCATTGTTATTAAAATTTTCATCGGACCAAAGTACGATGGGAACGGCTACCCGATCGATTTAGAAAATAACTGGTGGAACCTCGTGGAAATCGATTGGTTCACACATAAATTGACATCCGGACAAAACAAAATTGAACGCAAATCCGAAAACTTTTTTTGGTTCAAAGAAGACAGCTGGAGCGTGTCAAAAATTTACGAACTGCTGAATAATGGTCAGGTACCGCGTTACATGATTGAGAAATTCTTACTGCTTCCACGTCGTCTGCTGTTGCCGCGGGGTACCGAAGGCGGTGTTCCTTTCCAGTTCTTTGTATTCGTTTATCCTCAGTATGCTCCTTATAAGGAATGGGAGCCGATGAAAGAGTTTGTTGTCGATAATAAACCTTTTGGTTACCCATTTGACCGCCCTGTAACCGAATCTTATTATTTTACGCAGCCGAATATGTATTTTAAAGATGTATATATATACCAGGAGGGCGAAGAGTATCCCTACTACACCAGCTGGCAGAATCAACCCGTCAAGCATGGGTCGGGCAGCGGCTCAGGCTCAGGGTCGGGTGTGTCCAAAGGCGAGGAGGACAACATGGCGATCATCAAAGAGTTCATGCGCTTTAAGGTGCATATGGAAGGTTCAGTTAACGGTCACGAATTTGAAATCGAGGGTGAGGGGGAAGGCCGACCGTACGAGGGTACCCAGACTGCGAAACTGAAGGTGACCAAGGGGGGCCCTCTGCCGTTCGCTTGGGACATTTTATCCCCGCAGTTCATGTACGGTAGCAAAGCGTACGTGAAACATCCCGCCGATATCCCGGATTATCTGAAACTGAGTTTTCCGGAGGGTTTTAAATGGGAAAGAGTAATGAACTTTGAAGACGGCGGCGTAGTTACCGTCACTCAGGATAGTAGCTTGCAAGATGGTGAATTTATTTATAAAGTCAAATTGCGGGGCACCAACTTCCCTAGCGACGGCCCGGTTATGCAAAAAAAAACGATGGGCTGGGAAGCTTCGTCTGAACGCATGTATCCGGAAGATGGTGCCCTGAAAGGCGAAATAAAGCAACGTCTAAAACTGAAGGACGGTGGACACTACGATGCGGAGGTCAAAACCACGTATAAGGCCAAAAAACCAGTGCAGCTGCCAGGCGCCTACAACGTAAACATCAAACTGGATATAACGTCCCATAATGAAGACTATACGATCGTTGAGCAGTATGAACGCGCAGAAGGTCGCCATTCTACCGGAGGTATGGACGAATTATATAAGtaa