KEY: ENSP* - Homo sapiens (Human) ENSBTA* - Bos taurus (Cow) ENSTTR* - Tursiops truncatus (Dolphin) bmy_* - Balaena mysticetus (Bowhead whale) BACU* - Balaenoptera acutorostrata (Minke whale) Copy from below the line for a valid .msf file. ---------------------------------------------------------------------------------- NoName MSF: 5 Type: P Thu May 15 18:28:25 2014 Check: 00 .. Name: ENSTTRP00000012561/1-1086 Len: 1477 Check: 3171 Weight: 1.00 Name: BACU003591/1-854 Len: 1477 Check: 3015 Weight: 1.00 Name: bmy_05031T0/1-1199 Len: 1477 Check: 5391 Weight: 1.00 Name: ENSP00000155858/1-1165 Len: 1477 Check: 7399 Weight: 1.00 Name: ENSBTAP00000056352/1-1133 Len: 1477 Check: 4909 Weight: 1.00 // 1 50 ENSTTRP00000012561/1-1086 MQGAGGSRPG SHGGAGDQQG PGLCQGEIEF GGSGKKRGQ- ---------- BACU003591/1-854 ---------- ---------- ---------- ---------- ---------- bmy_05031T0/1-1199 MQGAGGARPG SHGGAGDRQG PGLCRGEIEF GGSGKKRGKF VKVPSGVAPS ENSP00000155858/1-1165 MQDVQGPRPG SPGDAEDRRE LGLHRGEVNF GGSGKKRGKF VRVPSGVAPS ENSBTAP00000056352/1-1133 MQDAGGTRPG SPRGAGDTWE PSLGRGEIIF GRSGKKRGKF VRVPSGVAPS 51 100 ENSTTRP00000012561/1-1086 ---------- ----PNTLMS LVGEERPFAL KPWLRDVLRK GLVKAAQSTR BACU003591/1-854 ---------- ---------- ---------- ---------- ---------- bmy_05031T0/1-1199 VLFDLLLAEW HLPAPNLAVS LVGEERPFAL KPWLRDVLRK GLVKAAQSTR ENSP00000155858/1-1165 VLFDLLLAEW HLPAPNLVVS LVGEEQPFAM KSWLRDVLRK GLVKAAQSTG ENSBTAP00000056352/1-1133 VLFDLLLAEW HLPAPNLVVS LVGEERPFAL KPWLRDVLRK GLVKAAQSTG 101 150 ENSTTRP00000012561/1-1086 AWILTSALRV GLARYVGQAV CDHSLASTST KA-------- HMVAIGIASL BACU003591/1-854 ---------- ---------- ---------- ---------- ---------- bmy_05031T0/1-1199 AWILTSALRV GLARYVGQAV RDHSLASTST EAAWWPSASP HSAACCTSAS ENSP00000155858/1-1165 AWILTSALRV GLARHVGQAV RDHSLASTST KV-------- RVVAVGMASL ENSBTAP00000056352/1-1133 AWILTSALRV GLARYVGQAV RDHSLASTST KA-------- RVVAIGIASL 151 200 ENSTTRP00000012561/1-1086 GRVLHRQLLD DAQEDSPVHY PVDDGGNQGP XXXXXXXXXX XXXXXXXXXX BACU003591/1-854 ---------- ---------- ---------- ---------- ---------- bmy_05031T0/1-1199 GR-------- -RPGGQPCPL PHGRWWPPGP --PLRPGQR- -----PGRPG ENSP00000155858/1-1165 GRVLHRRILE EAQEDFPVHY PEDDGGSQGP LCSLDSNLSH FILVEPGPPG ENSBTAP00000056352/1-1133 GRVLHRRLLD HAQGDGPVHY PLDDGGSQGP FCPLDNNHSH FILVEPGSPG 201 250 ENSTTRP00000012561/1-1086 XXXXXXXXXX XXXXXXXXXX XXXXXSSSIE IPVLSLLVNG DPSTLERTSR BACU003591/1-854 ---------- ---------- ---------- ---------- ---------- bmy_05031T0/1-1199 EGSEPTERRL RLEKHISEQR TGYGRTGSIE IPVLCLLVNG DPSTLERISR ENSP00000155858/1-1165 KGDGLTELRL RLEKHISEQR AGYGGTGSIE IPVLCLLVNG DPNTLERISR ENSBTAP00000056352/1-1133 KGDGPTELRL RLEKYISEQR TGYGGTSSIE IPVLCLLVNG DPSTLERISR 251 300 ENSTTRP00000012561/1-1086 AVEHAGPWLI LAGSGGIADM LAALTNQPHL LVPQVAEKKF REKFPSENFS BACU003591/1-854 ---------- ---------- ---------- ---------- ---------- bmy_05031T0/1-1199 AVEHAGPWLI LAGSGGIADV LAALMNQPHL LVPQVAKKQF RETFPSENFS ENSP00000155858/1-1165 AVEQAAPWLI LVGSGGIADV LAALVNQPHL LVPKVAEKQF KEKFPSKHFS ENSBTAP00000056352/1-1133 AVEHAAPWLI LAGSGGIADV LAALMNQPHL LTPQVAEKQF REKFPSEHFS 301 350 ENSTTRP00000012561/1-1086 WEDVVHWTEL ---------- ---------- -------ILE ALVKAHKSRS BACU003591/1-854 ---------- ---------- ---------- ---------- ---------S bmy_05031T0/1-1199 WEDIVHWTEL LRNITSHPHL LTVHDFEQEG PEELDTVILK AL-------- ENSP00000155858/1-1165 WEDIVRWTKL LQNITSHQHL LTVYDFEQEG SEELDTVILK ALVKACKSHS ENSBTAP00000056352/1-1133 WEDVVRWTGL LQTIMSHRHL LTVHDFEQEG SEELDTVILK ALVKACKSHS 351 400 ENSTTRP00000012561/1-1086 QEAQDCPDEL KLPVAWDRVD IARSEI-NRD VQWKSRDLEE VMMDAPVSSK BACU003591/1-854 QEAQDYLHEL KLAVAWDCVD IARSEILNGD VEWKSRHLEE VMMDALVSNK bmy_05031T0/1-1199 ---------- ---------- ---------- ----SRHLEE VMMDALVSNK ENSP00000155858/1-1165 QEPQDYLDEL KLAVAWDRVD IAKSEIFNGD VEWKSCDLEE VMVDALVSNK ENSBTAP00000056352/1-1133 QEAQDYLDEL KLAVAWDRVD IAKSEIFNGD VEWKSRDLEE VMMDALVSNK 401 450 ENSTTRP00000012561/1-1086 PEFTRLFVDS GANVADF-TY GRLQQLYRSV PPKSLLFHLL QRTHEEGRLT BACU003591/1-854 PQFVRLFVDG GADVADVVT- ---------- ---------- ---------- bmy_05031T0/1-1199 PQFVRLFVDG GADVADFLTY GRLQQLYLSA PPKSLPFHLL RRKREEGWLT ENSP00000155858/1-1165 PEFVRLFVDN GADVADFLTY GRLQELYRSV SRKSLLFDLL QRKQEEARLT ENSBTAP00000056352/1-1133 PEFVRLFVDN GADVADFLTY GRLQQLYRSV APKSLLFDLL QRKHEEGRLT 451 500 ENSTTRP00000012561/1-1086 WAGLGAQQAR E----PPAFS LHEVSRVFRD FMHDACRGLY QAVSCERA-- BACU003591/1-854 ---------- ---------- ---------- ---------- ---------- bmy_05031T0/1-1199 LAGLGTQQAR E----PPPFS LHEVSRVLKD FLHDACRGLY QAVSCERAAG ENSP00000155858/1-1165 LAGLGTQQAR EPPAGPPAFS LHEVSRVLKD FLQDACRGFY QD-------- ENSBTAP00000056352/1-1133 LAGLGAQQAR EPPAGPPAFS LHEVSRVLKD FLHDACRGLY QAVSAGRG-- 501 550 ENSTTRP00000012561/1-1086 ---------- -----EQGPV RQPTGQKGLL DLNQRSENPW RDLFLWAALQ BACU003591/1-854 ---------- ---------- ---------- ------ESPW RDLFLWAVLQ bmy_05031T0/1-1199 GRAGPRLRAD TPVPQEQGPA RRPTGQKGLL DLNQRSESPW RDLFLWAVLQ ENSP00000155858/1-1165 GRPGDRRRA- -----EKGPA KRPTGQKWLL DLNQKSENPW RDLFLWAVLQ ENSBTAP00000056352/1-1133 ---------- -----ERGLA RRPSGQKGLL DLNQRSEDPW RDLFLWAVLQ 551 600 ENSTTRP00000012561/1-1086 NLHEMATYFR AM-------- ---------- ------GQEG VAAALAACKI BACU003591/1-854 NRHEMAAYFW AVVRTPGRGP GGGGGEAAAS SDLASQGQEG VAAALATRKI bmy_05031T0/1-1199 NRHEMATYFW TM-------- ---------- ------GQEG VAAALAACRI ENSP00000155858/1-1165 NRHEMATYFW AM-------- ---------- ------GQEG VAAALAACKI ENSBTAP00000056352/1-1133 NRHEMATYFW AM-------- ---------- ------GQEG VAAALAACKI 601 650 ENSTTRP00000012561/1-1086 LKEMSHLEPG AEGGRTLREA KHEQLALDLF S-CYRN---- ----LVRRNR BACU003591/1-854 LKEMSHLETE AEVGRTPREA KYEQLALDLF SECYRNSEER AFALLVRRNR bmy_05031T0/1-1199 LKEMSHLETE AEVGRTPREA KYEQLALG-- --AQPRHEEC TFALLVRRNH ENSP00000155858/1-1165 LKEMSHLETE AEAARATREA KYERLALDLF SECYSNSEAR AFALLVRRNR ENSBTAP00000056352/1-1133 LKEMSHLETE AEVGRTLREA KYEQLALDLF SQCYRHSEER AFALLVRRNR 651 700 ENSTTRP00000012561/1-1086 SCSRATCLHL ATEADTEAFF AHDGVQAFLT EIWWGDVAAG TPILRLLGAF BACU003591/1-854 SWSRTTCLHL ATEADTKAFF AHDGVQAFLT KIWWGDMALG TPILRLLGAF bmy_05031T0/1-1199 SWSRTTCLHL ATEADTKAFF AHDGVQAFLT KIWWGDMASG TPILRLLGAF ENSP00000155858/1-1165 CWSKTTCLHL ATEADAKAFF AHDGVQAFLT RIWWGDMAAG TPILRLLGAF ENSBTAP00000056352/1-1133 CWSRTTCLHL ATEADTKAFF AHDGVQAFLT KIWWGDMASG TPILRLLGAF 701 750 ENSTTRP00000012561/1-1086 LCPGLVYTNL ITF------- ---------- -SEEAPLRTG PEDLQGLGSL BACU003591/1-854 LCPALVYTNL ITF------- ---------- -SEEAPLRTG PEDLQGADSL bmy_05031T0/1-1199 LCPALVFTDL ITFRPVRTER LGGLALHWGW WRHEAPLRTG PEDPQGLDSL ENSP00000155858/1-1165 LCPALVYTNL ITF------- ---------- -SEEAPLRTG LEDLQDLDSL ENSBTAP00000056352/1-1133 FCPALIYTNL IAF------- ---------- -SKEAPLRTG PEDLQELDSL 751 800 ENSTTRP00000012561/1-1086 DTEKRLLCG- ---------- ------P--G SRAEQLAKAP GTQRGRG-RA BACU003591/1-854 DAEKRLDAGG QRQARGSGHP SHRWPWPPSW CRAAELAKAP RTQSSRGPRA bmy_05031T0/1-1199 DAEKRLLCG- ---------- ------P--G SRAAELAKAL RTQSSRGPRA ENSP00000155858/1-1165 DTEKSPLYG- ---------- ------L--Q SRVEELVEAP RAQGDRGPRA ENSBTAP00000056352/1-1133 DTEKSLLCG- ---------- ------LGSR CRAEELSEAL RSERDRGRRA 801 850 ENSTTRP00000012561/1-1086 ASSFTRWRKF WGAPVTVVLG NVVTYFAFLF LFTHVLLVDS RSPPQGLSGA BACU003591/1-854 AFLLTCWRKF WGAPMTVFLG NVVMYFAFLF LFTYVLLVDF RPPPQGLAGA bmy_05031T0/1-1199 AFLLRCWRKF WGAPVTVFLG NVIMYFSFLF LFTYVLLVDF RPPPQGLSGA ENSP00000155858/1-1165 VFLLTRWRKF WGAPVTVFLG NVVMYFAFLF LFTYVLLVDF RPPPQGPSGP ENSBTAP00000056352/1-1133 AFLLTRWRKF WGAPVTVFLG NVVMYFAFLF LFTYVLLVDF RPPPQGPSGS 851 900 ENSTTRP00000012561/1-1086 EVTPYFWVFT LVLEEIRQGF FTDEDTHLGK K-ALYVENNW NKCGMVAMFP BACU003591/1-854 EVTLYFWVFT LVLEESRQ-- ---------- KVTLYVENNR NKCDMVAIFL bmy_05031T0/1-1199 EVTLYFWVFT LVLEEIWQGF FTDEDTRLVK KVTLYVEDNR NKCDMAAIFL ENSP00000155858/1-1165 EVTLYFWVFT LVLEEIRQGF FTDEDTHLVK KFTLYVGDNW NKCDMVAIFL ENSBTAP00000056352/1-1133 EVALYFWVFT LVLEEIRQGF FTDEDTHLVK KLTLYVEDSW NKCDLVAIFL 901 950 ENSTTRP00000012561/1-1086 FVVGVTC--- ---------- ---------- ---------- ---------- BACU003591/1-854 FIVSVTC--- ---------- ---------- ---------- ---------- bmy_05031T0/1-1199 FIVGVTCRSV GPRGLAGAAL LGWWAGLPSG WVCSGPGNPV FVCPVLCVLG ENSP00000155858/1-1165 FIVGVTC--- ---------- ---------- ---------- ---------- ENSBTAP00000056352/1-1133 FIAGVTC--- ---------- ---------- ---------- ---------- 951 1000 ENSTTRP00000012561/1-1086 ---------- ---------- XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX BACU003591/1-854 ---------- ---------- GMLPSVYEAG RTILAIDFMV FTLRLIHIFA bmy_05031T0/1-1199 EAPALSGHQQ PSSLPYPRPV RMLPSVSEAG RTILAIDFMV FTLRLIHTFA ENSP00000155858/1-1165 ---------- ---------- RMLPSAFEAG RTVLAMDFMV FTLRLIHIFA ENSBTAP00000056352/1-1133 ---------- ---------- RMLPWAFEAG RTVLAIDFMV FTLRLIHIFA 1001 1050 ENSTTRP00000012561/1-1086 XXXXXXXXXX XXXXX----- ---------- ---------- ------XXXX BACU003591/1-854 IHKQLGPNIT IVERM----- ---------- ---------- ------MKDV bmy_05031T0/1-1199 IHKQLGPKII IVERMVSPRS WEGSEAEAHC QAGADASLPT RLPPVEMKDV ENSP00000155858/1-1165 IHKQLGPKII VVERM----- ---------- ---------- ------MKDV ENSBTAP00000056352/1-1133 VHKQLGPKII IVERM----- ---------- ---------- ------MKDV 1051 1100 ENSTTRP00000012561/1-1086 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX BACU003591/1-854 FFFLFFLSVC LVAYGVTSQA LLHPHDRRLE WIFRHVLYQP YLQIFGQIPL bmy_05031T0/1-1199 FFFLFFLSVW LVAYGVTSQA LLHPHDRRLE WIFRCVLYRP YLQIIRQIPL ENSP00000155858/1-1165 FFFLFFLSVW LVAYGVTTQA LLHPHDGRLE WIFRRVLYRP YLQIFGQIPL ENSBTAP00000056352/1-1133 FFFLFFLSVW LVAYGVTTQA LLHPHDGRLE WIFRRVLYRP YLQIFGQIPL 1101 1150 ENSTTRP00000012561/1-1086 XXXXXACVNC SMHPPLLEDS PSCPNLDASW LV-----TFL LVTNVLLMNS BACU003591/1-854 DEIDEARGDR SVHPLLLEDS PSCPNLYANR LVIV-----L LVTNVLLVNL bmy_05031T0/1-1199 DEIDEARVNR SVHPLLLEGS PSCPNLYANR LVIVLLVTFL LVTNVLLVNL ENSP00000155858/1-1165 DEIDEARVNC STHPLLLEDS PSCPSLYANW LVILLLVTFL LVTNVLLMNL ENSBTAP00000056352/1-1133 DEIDEARVNC SLHPLLLEGS PSCPNLYANW VVILLLVTFL LVTT------ 1151 1200 ENSTTRP00000012561/1-1086 LIAML--S-- ---------- ---------- ---------- ---------- BACU003591/1-854 LIAMFRCPPC RLAPSTSGAL SSSVSGPDLP GPPDLKLRSP LCPQEGPGNG bmy_05031T0/1-1199 FIATFRAT-- ---------- ---------- ---------- ---------- ENSP00000155858/1-1165 LIAMF--S-- ---------- ---------- ---------- ---------- ENSBTAP00000056352/1-1133 --GPG--P-- ---------- ---------- ---------- ---------- 1201 1250 ENSTTRP00000012561/1-1086 ---------- ---------- ---------- ---------- ---------- BACU003591/1-854 VSPKTVAALQ VGSGGSRARV CEGCSRGGSG RPQRSGEKSQ EEGTAGGLGG bmy_05031T0/1-1199 ---------- ---------- ---------- ---------- ---------- ENSP00000155858/1-1165 ---------- ---------- ---------- ---------- ---------- ENSBTAP00000056352/1-1133 ---------- ---------- ---------- ---------- ---------- 1251 1300 ENSTTRP00000012561/1-1086 ---------- ---------- ----YTFQVV QGNADTFWKF QRYHLIVENH BACU003591/1-854 SPWDSGNSAG SRAVCPPPPT SVRSYTLQVV RGNADTFWKF QRYHLIVKYH bmy_05031T0/1-1199 ---------- ---------- ----RTPSVS SSATTSSWST TSAPPW---- ENSP00000155858/1-1165 ---------- ---------- ----YTFQVV QGNADMFWKF QRYNLIVEYH ENSBTAP00000056352/1-1133 ---------- ---------- ----GRGQGA AGPGPLCGSP EGLPTM---- 1301 1350 ENSTTRP00000012561/1-1086 ERPAPAPPII LLSHLNLVLK RFFRKEAQQK GPRLERDLPE PLDQKMVTWE BACU003591/1-854 ERPALAPPII LLSHLNLVLK RFFRKEAQQK RRAWRGTCRS P-------WE bmy_05031T0/1-1199 --PRPGPALH ---------- ---------- PPQP----PE P--------- ENSP00000155858/1-1165 ERPALAPPFI LLSHLSLTLR RVFKKEAEHK REHLERDLPD PLDQKVVTWE ENSBTAP00000056352/1-1133 GLPGLGPALR L--------- ---------- PGSPERDLPE PLDQKVVTWE 1351 1400 ENSTTRP00000012561/1-1086 AVQKENFLSE LERRRKDSGE ELPQKTAHRV -----DFVAK YLGGLREQ-- BACU003591/1-854 AVQKENFLSE LKKPRKDSGE ELLRKTAHRA PGPTDTPACP HLQGSPPQQD bmy_05031T0/1-1199 --GAQEVLPE GGPAEAGTPG EGPAGAP--- -----GPEDG HLGGGAEELP ENSP00000155858/1-1165 TVQKENFLSK MEKRRRDSEG EVLRKTAHRV -----DFIAK YLGGLREQ-- ENSBTAP00000056352/1-1133 AVQKENFLSE LEKRRRESQE ELLRKAAHRV -----DVVAK YLGGLREQ-- 1401 1450 ENSTTRP00000012561/1-1086 -KRVERLESQ VDYCTALLSS MADTLARRNS Y--------- WNS-NFSGGN BACU003591/1-854 VARGFAKRNH SLPCPA---P SLDVLGPG-- ---------- ------CSGV bmy_05031T0/1-1199 ERAGEAEEGQ RGGAAA--ED GPQVRGPGPA PPPRWAWQAA WDSRNFSGGN ENSP00000155858/1-1165 EKRIKCLESQ INYCSVLVSS VADVLAQGGG P--------- RSSQHCGEGS ENSBTAP00000056352/1-1133 ERRVRRLESQ VGYCAALLSS VAESLARGSA S--------- WNSQNSSSGN 1451 1500 ENSTTRP00000012561/1-1086 PQASADHRGG LGSREHPEAG RLPSNT- BACU003591/1-854 SSALPDRQSP QDPDQNQPAR TRRADTC bmy_05031T0/1-1199 PQASAARGGG LGSRKHPEAG RLPSNT- ENSP00000155858/1-1165 QLVAADHRGG LDGWEQPGAG QPPSDT- ENSBTAP00000056352/1-1133 PSASADHGGA LGSTECPEAG QPPSNP-