KEY: ENSP* - Homo sapiens (Human) ENSBTA* - Bos taurus (Cow) ENSTTR* - Tursiops truncatus (Dolphin) bmy_* - Balaena mysticetus (Bowhead whale) BACU* - Balaenoptera acutorostrata (Minke whale) Copy from below the line for a valid .msf file. ---------------------------------------------------------------------------------- NoName MSF: 5 Type: P Thu May 15 16:56:13 2014 Check: 00 .. Name: ENSP00000331342/1-1283 Len: 1424 Check: 8291 Weight: 1.00 Name: ENSBTAP00000015513/1-1220 Len: 1424 Check: 2474 Weight: 1.00 Name: bmy_14641T0/1-1372 Len: 1424 Check: 3424 Weight: 1.00 Name: ENSTTRP00000016064/1-1208 Len: 1424 Check: 4941 Weight: 1.00 Name: BACU003050/1-1107 Len: 1424 Check: 4350 Weight: 1.00 // 1 50 ENSP00000331342/1-1283 MSLMVSAGRG LGAVWSPTHV QVTVLQARGL RAKGPGG--- ---TSDAYAV ENSBTAP00000015513/1-1220 MSLAASAGRG PGAVWSPTHV QVTVLQARGL RAKGPGG--- ---TSDAYAV bmy_14641T0/1-1372 GACKCNTCFP YLAKDEQAFL NSNQIKRESL EVRMDGVSLW HQLTSHKRTV ENSTTRP00000016064/1-1208 ---------- ---------- ---------- ----PGG--- ---TSDAYAV BACU003050/1-1107 MSLAASAGRG PGAVWSPTHV QVTVLQARGL RAKGPGG--- ---TSDAYAV 51 100 ENSP00000331342/1-1283 IQVGKEKYAT SVSERSLGA- ---------- -----PVW-- ---REEATFE ENSBTAP00000015513/1-1220 IQVGKEKYAT SVSERSLGA- ---------- -----PVW-- ---REEATFE bmy_14641T0/1-1372 APLESDFFPP WVQKRKLKAL RANCADIFIC SLNKPPCIIY LESREYAFYS ENSTTRP00000016064/1-1208 IQVGKEKYAT SVSERSLGA- ---------- -----PVW-- ---REEATFE BACU003050/1-1107 IQVGKEKYAT SVSERSLGA- ---------- -----PVW-- ---REEATFE 101 150 ENSP00000331342/1-1283 LPSLLSSG-- ---------- ----PAAAAT LQLTV----- ---------- ENSBTAP00000015513/1-1220 LPPLLSAE-- ---------- --AAPAAAAT LQLTV----- ---------- bmy_14641T0/1-1372 RENLVACEKI WIPIKMTSGV TGNPPTQRRN LIARVQDGDL RDMANQGRTT ENSTTRP00000016064/1-1208 LPPLLSAE-- ---------- -AAPAAAAAT LQLTV----- ---------- BACU003050/1-1107 LPPLLSAE-- ---------- --AASAAAAT LQLTV----- ---------- 151 200 ENSP00000331342/1-1283 -------LHR ALLGLDK--- ---------- --------FL GRAEVDLRDL ENSBTAP00000015513/1-1220 -------LHR ALLGLDK--- ---------- --------FL GRAEVDLRGL bmy_14641T0/1-1372 KASRKPGLKR PLVILKNSCP RILKYWCLVF PFVFTYKEFV RKKRQQLFQC ENSTTRP00000016064/1-1208 -------LHR ALLGLDK--- ---------- --------FL GRAEVDLREL BACU003050/1-1107 -------LHR ALLGLDK--- ---------- --------FL GRAEVDLREL 201 250 ENSP00000331342/1-1283 HR-------- ---------- --DQGRRKTQ ---------- ---------- ENSBTAP00000015513/1-1220 HQ-------- ---------- --NQGRRRTQ ---------- ---------- bmy_14641T0/1-1372 NKRFVMFGEP ALYWWLLPCL VFDQENPKPK NSQTVQNRSS CHFPRSWLYF ENSTTRP00000016064/1-1208 HR-------- ---------- --DHGHRKTQ ---------- ---------- BACU003050/1-1107 HR-------- ---------- --DQGHRKTQ ---------- ---------- 251 300 ENSP00000331342/1-1283 ----WYKLKS KPGKKDKERG EIEVDIQFMR NNMTASMFDL SMKDKSRNPF ENSBTAP00000015513/1-1220 ----WYTLKS KPGKKDKERG EIEVDIQFMR NNMTASMFDL SMKDKSRNPF bmy_14641T0/1-1372 VNPVWYTLKS KPGKKNKERG EIEVDIQFMR NNMTASMFDL SMKDKSRNPF ENSTTRP00000016064/1-1208 ----WCTLKS KPGKKNKERG EIEVDIQFMR SNMTASMFDL SMKDKSRNPF BACU003050/1-1107 ----WYTLKS KPGKKNKERG EIEVDIQFMR NNMTASMFDL SMKDKSRNPF 301 350 ENSP00000331342/1-1283 GKLKDKIKGK NKDSGSDTAS AIIPSTTPSV DSDDESVVKD KKKKSKIKTL ENSBTAP00000015513/1-1220 GKLKDKIKGK NKDSASDTVS AIVPHVTPSA DSDDEAPSKD KKKKSKIKTL bmy_14641T0/1-1372 GKLKDKIKGK NKDSASDTVS AIIPSMTPSA DSDDESPSKD KKKKSKLKTL ENSTTRP00000016064/1-1208 GKLKDKIKGK NKDSASDTVS AIIPSVTPSA DSDDESPSKD KKKKSKLKTL BACU003050/1-1107 GKLKDKIKGK NKDSASDTVS AIIPSVTPSA DSDDESPSKD KKKKSKLKTL 351 400 ENSP00000331342/1-1283 LSKSNLQKTP LSQSMSVLPT SKPEKVLLRP GDFQSQW-DE DDNEDESSSA ENSBTAP00000015513/1-1220 FSKSNLQRTP LSQSMSVLPT SKSDKVMLRP GDFESRW--E DDDEDESSSA bmy_14641T0/1-1372 FSRSSLQRTP LSQSMSVLPT AKSDKVLLRP GDFQSQWEDE ENDEDKSSSA ENSTTRP00000016064/1-1208 FSRSNLQRAP LSQSMSVLPT VKSDKVLLRP GDFQSQWEDE DNGEDKSSSA BACU003050/1-1107 FSRSSLQRTP LSQSMSVLPT AKSDKVLLRP GDFQSQWEDE ENDEDKSSSV 401 450 ENSP00000331342/1-1283 SDVMSHKRTA STDLKQLNQV NFTLPKKEGL SFLGGLRSKN DVLSRSNVCI ENSBTAP00000015513/1-1220 SDVLSHKRTA SADPKQLNQV NFSLPKKEGL SFLGGLRSKN DALSRSNVCI bmy_14641T0/1-1372 SDALSHKGTA SVDPKQLNQI NFNPPKKEGL SFLGGLRSKN DILSRSNVCI ENSTTRP00000016064/1-1208 SDVLSHKGTA SVDPKQLNQI HFNPPKKEGL SFLGGLRSKN DTLSRSNVCI BACU003050/1-1107 SDALSHKGTA SVDPKQLNQI NFNPPKKEGL SFLGGLRSKN DILSRSNVCI 451 500 ENSP00000331342/1-1283 NGNHVYLEQP EAKGEIKDSS PSSSPSPKGF RKKHLFSSTE NLAAGSWKEP ENSBTAP00000015513/1-1220 NGNHVYVEPP EAKTETKDVS PSPSPSPQDL RKKRWFSSTE NLASRPWREP bmy_14641T0/1-1372 NGNHVYVEQP EAKSETKDSS PSSSPSPQGL RKKHLFSSTE NLASRPLKEP ENSTTRP00000016064/1-1208 NGNHVYVEQP EAKSETKGSS PASSPSPQGL RKKHLFSSTE NLASRPLKEP BACU003050/1-1107 NGNHVYVEQP EAKSETKDSS PSSTPSPQGL RKKHLFSSTE NLASRPLKEP 501 550 ENSP00000331342/1-1283 AEGGGLSSDR QLSESSTKDS LKSMTLPSYR PAPLVS-GDL RENMAPANSE ENSBTAP00000015513/1-1220 GEAGAVP--- --PESSTKDA LKSMSLPSYQ --PQVS-RDL RENAAPATLE bmy_14641T0/1-1372 GEGGAVP--- --SESSTKDS LKSLSLPSYQ --LLVS-GDL RENVALVTLE ENSTTRP00000016064/1-1208 GEGGAVP--- --SESSTKDS LKSMSLPSYQ --QLVS-GDL RENAALATLE BACU003050/1-1107 GEGGAVP--- --SESSTKDS LKSMSLPSYH --TRVSGGDL RENAVLVTLE 551 600 ENSP00000331342/1-1283 ATKEAKESKK PESRRSSLLS LMTGKKDVAK GSEGENPL-T VPGREKEGML ENSBTAP00000015513/1-1220 AAKETKESKK QENKKPALPF LVPGKKDTAK GSEGESPPAA ASGKEREGAP bmy_14641T0/1-1372 AAKETKDSKK QENKKSSLLS LVTGKKDTAK GSEGESPP-A APGKEREGTP ENSTTRP00000016064/1-1208 AAKETKESKK QENKKSSLLS LVTGKKDTAK GSEGESPP-A APGKEREGTP BACU003050/1-1107 AAKETKESKK QENKKSSLLS LVTGKKDTAK GSEGESPP-A APGKEREGTP 601 650 ENSP00000331342/1-1283 MGVKPGEDAS GPAEDLVRRS EKDTAAVVSR QGSSLNLFED VQITEPEAEP ENSBTAP00000015513/1-1220 TALRAGEHQP GPADDLVKKS DKEAAPVAAG PGRALNPFED EQLPEPEADP bmy_14641T0/1-1372 VAVKAREDQP GPVADPVKRS DKETAAIVSG RGRAPNPFED VQIPEREADP ENSTTRP00000016064/1-1208 VAVRASEDQP GPVDDPVKRS DEETAAIVSG RGRAPNPFED VQIPEPEADP BACU003050/1-1107 VAVKAREDQP GPVADPVKRS DKETAAIVSG RGRAPNPFED VQIPEREADP 651 700 ENSP00000331342/1-1283 ESKSEPRPPI SSPRAPQTRA VKPRLEVSPE AQPTARLPSP TDSPSSLPPL ENSBTAP00000015513/1-1220 EPKAAPVPPV FSPRAPQTRA VKPRPEVSPE ARPAPRPPPS TNSPLFLSTL bmy_14641T0/1-1372 EPKAAPAPPV PSPRAPQTRA VKPRLEVSPE AQPTPRLPPS TQSPLVFSAL ENSTTRP00000016064/1-1208 EPKAAPAPPV PSPRAPQTRA VKPRLEVSPE AQPAPRLPPS TRSPLVFSAL BACU003050/1-1107 EPKAAPAPPV PSPRAPQTRA VKPR------ ---------- ---------- 701 750 ENSP00000331342/1-1283 PSSSGQASVP SELGHGADTQ SSESPSVFSS LSSPIAAPIS TSTPIESWPL ENSBTAP00000015513/1-1220 ASGSGQVPVS SKTGHDSETP SSESPSGSSS ISSPIAAPIS TSTPIKNWPS bmy_14641T0/1-1372 PSSSGQTPVP SKLGRDSETQ SSESPSGSFS LPSPEAAPIS TSTPIEHWPS ENSTTRP00000016064/1-1208 PSSSGQTPIP SKLGHDSEPQ PLDSPPGS-S FCSPVAAPIS TSTPIEHWPP BACU003050/1-1107 ---SGQTPVP SKLGRDSETQ SSESPPGSFS FPSPEAAPIS TSTPIEHWPS 751 800 ENSP00000331342/1-1283 VDRGQAKSEG PPLLPKAELQ TESLTPVPNS GSSALGSLFK QPSFPANKGT ENSBTAP00000015513/1-1220 ADPGQASSEE PPSLLELELE KETQTRVPNI DLHAVGLLSK QTPVPLTGGR bmy_14641T0/1-1372 ADMGEASPKE PPSLLEPELE QESLTQVPST VPCAVGSLSK QTPIPVGKGT ENSTTRP00000016064/1-1208 TDVGEASPEE PPSLLEPELE KESLTQVPST VPCAVGSLSK QTPIPVGKGM BACU003050/1-1107 ADVGEASPKE PPSLLEPELE QESLTQVPST VPCAVGSLSK QTPVPVGKGT 801 850 ENSP00000331342/1-1283 EDSLMGRTRE TGTEKNTSSL ELEESLPEQP ETGRQEEELP RFPCKKQDYS ENSBTAP00000015513/1-1220 EDYSPEK--- ------TSTN DLAQSLRTQP EEGQ-EGGLL GSPWQEQQSD bmy_14641T0/1-1372 EDSPVGK--- ------TGAD DPGQSLQTQP EVGREEEELL GSPPRKQQGA ENSTTRP00000016064/1-1208 EDSPVGK--- ------TSAD DPGQSLRTQP EVGR-EEELL GSPPRKQRGT BACU003050/1-1107 EDSPVGK--- ------TGAD DPGQSLRTQP EVGREEEELL GSPPRKQQGA 851 900 ENSP00000331342/1-1283 -PSSGEAQEV PFALSLSSDG AVSPVGELAA GGDRDLESQA GSLVE----- ENSBTAP00000015513/1-1220 IASSDEAREV ASALAPGRGR RGSPAGKPPS REDPDSA--- ----KDAGGG bmy_14641T0/1-1372 IPSSVEAREV TSATALGRGG MGSPAGKPPR RGASGSAGQS RSPVGDADGG ENSTTRP00000016064/1-1208 IPSSVEAREV TPAAALRGGG TGSPAGKPPR REASCSAGQS RSPVGDADGG BACU003050/1-1107 IPSSVEAREG TPATALGRGG MGSPAGKPPR RGASGSAGPS RSPVGDADGG 901 950 ENSP00000331342/1-1283 SKARDAAEEV APPLPMGASV PSIDSMM-RK LEEMGLNLRK DQKKTKKRVS ENSBTAP00000015513/1-1220 GRLSPAVKEM APRLHTGESA PTPDSAMTQG HEEMDPDARG GGKDTKKQAL bmy_14641T0/1-1372 GGMSPAVKEA VPPVPVGESV PPLDSAM-QS HEEMGLDACE GRKEIEEQVL ENSTTRP00000016064/1-1208 GGMSPAVNEA VPP-PVGESV PPLDSAM-QR HEEMGPDACE GRKEIKGQVL BACU003050/1-1107 GGMSPAVKEA VPPVPVGESV PPLDSAM-QS HEEMGPDACE GRKDIQEQVL 951 1000 ENSP00000331342/1-1283 FSEQLFTEEA VAGAALLVEG HSSCPQELNP AWSVAGNASD GEPPESPHAE ENSBTAP00000015513/1-1220 FSKKLSTEEA GEENPRLVHG DRGVPQEPTP PGATPRNAMD ---------M bmy_14641T0/1-1372 FSEQLSTEGA GEKAPVLVRG DRGDPQELTP QGAAPGNAWD ---------T ENSTTRP00000016064/1-1208 FSEQLSREEA GEEAPVLVRG DRGDPQEVTP QGAAPGNAWD ---------T BACU003050/1-1107 FSEQLSTEGA GEKAPVLVRG DRGDPQELTP QGAAPGNAWD ---------T 1001 1050 ENSP00000331342/1-1283 DSERESVTTP GPATCGAPAS PADHLLLPSQ EESFSEVPMS EASSAKDTPL ENSBTAP00000015513/1-1220 GDSQEGAAVT GP----RLPA LATHTPCPSS KGSFSGGPQP TIGSGRAGPL bmy_14641T0/1-1372 GDSQEGAAVA GP----RPAA PAARPPRPSS KGSCSGGPAH AASSGRDGPL ENSTTRP00000016064/1-1208 GHSQEGAAVA GP----WPAA LAARPPCPSS KGSFLGGPAH AASSGRDGPR BACU003050/1-1107 GDSQEGAAVA GP----RPAA PAAHPPRPSS KGSCSGGPAH AASSGRDGPL 1051 1100 ENSP00000331342/1-1283 FRMEGEDALV TQYQSKASDH EGLLSDPLSD LQLVSDFKSP IMADLNLSLP ENSBTAP00000015513/1-1220 FRSQGDNTRM AWNQSKASDH EGLLSDPLRG LQAPCEAKPP GVAHLDLTLP bmy_14641T0/1-1372 LRSQGDDAQM ARNQIKASDH EGLLSGPLHG LQSACEAKPP ALAHLDLTLP ENSTTRP00000016064/1-1208 LKSQGDDSRM AQNQSKASDH EGLLSDPLHG LQSSCEAKPP ALAHLDLTLP BACU003050/1-1107 LGSQGDDARM ARNQSKASDH EGLLSDPLHG LQSACEAKPP ALAHLDLTLP 1101 1150 ENSP00000331342/1-1283 SIPEVASDDE RIDQVEDDGD QVEDDGETAK SSTLDIGALS LGLVVPCPER ENSBTAP00000015513/1-1220 SIPEVESEDE R-------GD QPEGGGGAAL VAGWGGGALS SSTGPVQLEG bmy_14641T0/1-1372 SIPEVASEDE R-------GD QLEDDRGMAP VAALGGGASS LSTWPARAER ENSTTRP00000016064/1-1208 SIPEVASEDE R-------GD QLEDDRGVAP VAALGGGASS PSTWPAHAER BACU003050/1-1107 SIPEVASEDE R-------GD QFEDGRGVAP VAALGGGASS PSTWPARAER 1151 1200 ENSP00000331342/1-1283 GKGPSGEADR LVLGEGLCDF RLQAPQASVT APSEQTTEFG IHKPHLGKSS ENSBTAP00000015513/1-1220 ERTPSEEAGG SAAG-SLHDP TPG---ATAP ASVEQT---- ---------- bmy_14641T0/1-1372 ERAPSGDAGG S-AG-GLRDP RPGAPRAPAP ASAEQTTALA VQEPQS---- ENSTTRP00000016064/1-1208 ERAPSREADG S-AG-GLREP RPGAPRAPAP ASAEQTTALE VQEPQS---- BACU003050/1-1107 ERAPSGESRG P------ARP QAGSAQSTRP HFCRAG---- ---------- 1201 1250 ENSP00000331342/1-1283 SLDKQLPGPS -GGEEEKPMG NGSPSPPPGT SLDNPVPSPS PSEIFPVTHS ENSBTAP00000015513/1-1220 ------PGPG SNGGKAGPAG DGSPPRPPAA ALGSPVSSSP TPPPLPATHS bmy_14641T0/1-1372 ------PGPG -DSGEAKPVG DGRPGQPPAA ALDSLVSGSP FSEPFPATRS ENSTTRP00000016064/1-1208 ------PGPG -DSGEAKPVG DGRPGQPPAA ALDSLVSGPP FSEPFPATRS BACU003050/1-1107 ------PSLG -GSG------ ---------- ---TTVAGPR LHPVKPM--- 1251 1300 ENSP00000331342/1-1283 FPSSAHSDTH HTSTAESQKK ATAEGSAGRV ENFGKRKPLL QAWVSPSETH ENSBTAP00000015513/1-1220 FTRSAHSDTP HTNTAESQKK ATAEGSAGNV ENPGKRKPLL QARVSPSET- bmy_14641T0/1-1372 FPTSARSDTH HTSTAESQKK AAVEGSAGKV ENSGKRKPLL QAWVSPSDTQ ENSTTRP00000016064/1-1208 FPTYAHSDTH HTSTAESQKK AAVEGSAGKV ENSGKRKPLL QAWVSPSDTQ BACU003050/1-1107 ---------- --NAAATKVA LSSSGTATVI SENLVNEA-- ---------- 1301 1350 ENSP00000331342/1-1283 PVSAQPGAGT GSAKHRLHPV KPMNAMATKV ANCSLGTATI ISENLNNEVM ENSBTAP00000015513/1-1220 ----QPSTGS RPAKHRLHPV KPMNTTAPKV SNSTSGTTAI VSENLINEAG bmy_14641T0/1-1372 PVSAQPSAGS GSAKHRLHPV KPMNAAATKV ALSSSGTATV ISENLVNEAM ENSTTRP00000016064/1-1208 PVSARPSAGS GSAKHXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX BACU003050/1-1107 ---------- ---------- ---------- ---------- ---------M 1351 1400 ENSP00000331342/1-1283 MKKYSPSDPA FAYAQLTHDE LIQLVLKQKE TISKKEFQVR ELEDYIDNLL ENSBTAP00000015513/1-1220 MKKYKPSDPA FAYAQLTHDE LIQLVLKQKE TISKKECQVR QLEDYIDNLL bmy_14641T0/1-1372 MKKYKPSEPE FAYAQLTHDE LIQLVLKQKE AISKKECQVR QLEDYIDNLL ENSTTRP00000016064/1-1208 XXKYKPSEPA FAYAQLTHDE LIQLVLKQKE TISKKECQVR QLEDYIDNLL BACU003050/1-1107 MKKYKPSEPE FAYAQLTHDE LIQLVLKQKE AISKKECQVR QLEDYIDNLL 1401 1450 ENSP00000331342/1-1283 VRVMEETPNI LRIPTQVGKK AGKM ENSBTAP00000015513/1-1220 VRVMEETPNI LRVPSQVGKK AGKM bmy_14641T0/1-1372 VRVMEETPSI LRVPTQVGRK AGKM ENSTTRP00000016064/1-1208 VRVMEETPSI LRVPSQVGKK AGKM BACU003050/1-1107 VRVMEETPSI LRVPSQVGRK AGKM