EMLSAG00000011843, EMLSAG00000011843-694609 (gene) Lepeophtheirus salmonis

Overview
NameEMLSAG00000011843
Unique NameEMLSAG00000011843-694609
Typegene
OrganismLepeophtheirus salmonis (salmon louse)
Associated RNAi Experiments

Nothing found

Homology
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:Cpsf100 "Cleavage and polyadenylation specificity factor 100" species:7227 "Drosophila melanogaster" [GO:0006379 "mRNA cleavage" evidence=ISS;NAS] [GO:0005847 "mRNA cleavage and polyadenylation specificity factor complex" evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP;NAS] [GO:0003730 "mRNA 3'-UTR binding" evidence=NAS] [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing" evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008 GO:GO:0006378 GO:GO:0016787 GO:GO:0003723 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402 OMA:FNAGHTL InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 GeneTree:ENSGT00730000111069 OrthoDB:EOG741Z1H GO:GO:0006398 EMBL:AF160933 RefSeq:NP_651658.1 RefSeq:NP_733264.1 UniGene:Dm.1362 ProteinModelPortal:Q9V3D6 SMR:Q9V3D6 BioGrid:68297 IntAct:Q9V3D6 MINT:MINT-7957520 STRING:7227.FBpp0084726 PaxDb:Q9V3D6 PRIDE:Q9V3D6 EnsemblMetazoa:FBtr0085357 GeneID:43426 KEGG:dme:Dmel_CG1957 UCSC:CG1957-RA CTD:43426 FlyBase:FBgn0027873 InParanoid:Q8IML7 PhylomeDB:Q9V3D6 GenomeRNAi:43426 NextBio:833860 PRO:PR:Q9V3D6 Bgee:Q9V3D6 Uniprot:Q9V3D6)

HSP 1 Score: 806.594 bits (2082), Expect = 0.000e+0
Identity = 415/780 (53.21%), Postives = 549/780 (70.38%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPS-PKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTK-GSGRTIELEIKKRVELTGTELEEYNKHRDE----LIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGK--TGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNV---KQNNE-IKDDRSNIQSEV----------------PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTD--DSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG   E P CY+L++DD   LLD GWD  F++   KE+K+    +DAVLLS+PD  HLGALPY VGKLGL+C I+AT+PV KMGQMFMYD+Y +     DFDLF+LD+VD+AF+KITQLKYNQT +LK KG GISITP+ AGHMIGGTIWKIVK GEEDI+YA DFNHKKERHL+GC+LD+L RPS+LITDA+N    Q RRR RDEK+MTNILQT+RNNGNVLI  DTAGRVLELAHM+DQLW+N++SGL+AYSLAL+NN SY+V+EFAKSQIEWMS+KL K FEG RNNPFQFKH++LCHS+ +V K+P+ PKVVLAS PD+ESG++R+LF+QW +N  NSIILT+R+   TLA +L+     G+ IEL++++RV+L G ELEEY + + E    LIVK  +    +  +   D EM +   KHDI+++         P+G+  +GFF+S K    +FP HEEKV + D+YGEI+  +D+  I+ +T    V   +QN E +K +   I +E                 PTK ++ + +  + AQ+Q IDFEGRS+G+S+LK+L Q++PR+VIV+ GT E    +   CEQ           V++P+ GE++DVT+E  IYQVRLTE LV  L++  GKD +++AWVDG + +     ++   +  E+D + +E           D  P+     H +  +NELKLSDFK  L +N I+SEF GGVL+C +G +ALRR D+G++ +EG L  EYY++RELLYEQYAIV
Sbjct:    1 MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLLSHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVDTAFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKKERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTAGRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEGAELEEYLRTQGEKLNPLIVKPDVEEESS-SESEDDIEMSVITGKHDIVVR---------PEGRHHSGFFKSNKRHHVMFPYHEEKV-KCDEYGEIINLDDY-RIADATGYEFVPMEEQNKENVKKEEPGIGAEQQANGGIVDNDVQLLEKPTKLISQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAEGTQVVARHCEQ------NVGARVFTPQKGEIIDVTSEIHIYQVRLTEGLVSQLQFQKGKD-AEVAWVDGRLGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLADDEIPI-----HNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRVDAGKVAMEGCLSEEYYKIRELLYEQYAIV 756          
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:cpsf2 "cleavage and polyadenylation specific factor 2" species:7955 "Danio rerio" [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation specificity factor complex" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-040718-79 GO:GO:0006378 GO:GO:0016787 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 HOGENOM:HOG000264343 CTD:53981 HOVERGEN:HBG051106 EMBL:BC076029 RefSeq:NP_001002384.1 UniGene:Dr.121547 ProteinModelPortal:Q6DHE5 STRING:7955.ENSDARP00000088639 PRIDE:Q6DHE5 GeneID:436657 KEGG:dre:436657 InParanoid:Q6DHE5 NextBio:20831102 Bgee:Q6DHE5 Uniprot:Q6DHE5)

HSP 1 Score: 797.734 bits (2059), Expect = 0.000e+0
Identity = 410/804 (51.00%), Postives = 550/804 (68.41%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEY-NKHRDELIVKKSLTSV----LNGGDESS-DDEME----ISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTD--DSADII-----PDEEDDAFE-----------EPSLKKPR----------------------IPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   +K+   ++DAVLLS+PD  HLGALPYAVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VDSAFDKI QLKY+Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+IIY VDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+++TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCHS++++ +VPSPKVVL S PD+ESG+SRELFIQWC + KNS+ILT R+   TLA  L+     + IELEI+KR  L G ELEEY  K R +    K L       L+  DES  +D++E    +  K HD++MK +          K GFF+  K  + +FPTHEE+ I+WD+YGEI+R ED+L    Q+TE    K  + + +    ++   S+VPTKC ++  +  I+A++ +ID+EGRS+GDSI K++ Q+KPR++I+V G P+    L   C+  + K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   +D ++LAW+DGV+++  +  D+  I+      DE ++  E           EPS                           IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG  C +YYR+RELLYEQYA+V
Sbjct:    1 MTSIIKLTALSGVQEESALCYLLQVDEFRFLLDCGWDETFSMDIIDSLKRYVHQVDAVLLSHPDHVHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDSAFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIIYGVDFNHKREIHLNGCSLESLSRPSLLITDSFNASYVQPRRKQRDEQLLTNVMETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRTTPGTLARYLIDNPGEKRIELEIRKRCRLEGRELEEYMEKERMKKEAAKKLEQAKEVDLDSSDESDMEDDLEQPAVVKTKHHDLMMKGEGGR-------KGGFFKQAKKSYSMFPTHEER-IKWDEYGEIIRPEDFLVPELQATEEEKSKLESGLTNGEEPMEQDLSDVPTKCTSTTQTLDIRARVMYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDASQDLAESCKAYSGKD----IKVYIPKLQETVDATSETHIYQVRLKDSLVSSLQFCKARD-TELAWIDGVLDMRVEKVDTGVIVELGEAKDEAEEGGEQGMEVTEELNTEPSTAAAANQRAMKTLFGEDEKEISEESDVIPTLEPLPAHEVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLVC-NNLVAVRRTEAGRICLEGCHCDDYYRIRELLYEQYAVV 790          
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:Cpsf2 "cleavage and polyadenylation specific factor 2, 100kDa" species:10116 "Rattus norvegicus" [GO:0005847 "mRNA cleavage and polyadenylation specificity factor complex" evidence=IEA;ISO] [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing" evidence=IEA;ISO] [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521 RGD:1309687 GO:GO:0006378 GO:GO:0016787 EMBL:CH473982 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 KO:K14402 OMA:FNAGHTL InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 GeneTree:ENSGT00730000111069 CTD:53981 OrthoDB:EOG741Z1H TreeFam:TF106131 GO:GO:0006398 EMBL:AABR06045956 EMBL:AABR06045957 RefSeq:NP_001100223.1 RefSeq:XP_006240522.1 UniGene:Rn.8038 Ensembl:ENSRNOT00000008612 GeneID:299256 KEGG:rno:299256 UCSC:RGD:1309687 NextBio:645098 PRO:PR:D3Z9E6 Uniprot:D3Z9E6)

HSP 1 Score: 780.785 bits (2015), Expect = 0.000e+0
Identity = 404/800 (50.50%), Postives = 540/800 (67.50%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEIS---------GKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSAD----------------------IIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   ++K   +IDAVLLS+PD  HLGALP+AVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VD+AFDKI QLK++Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH ++++ +VPSPKVVLAS PD+E G+SR+LFIQWC +PKNSIILT R+   TLA  L+   S +  E+E++KRV+L G ELEEY +                  D  S DE ++            KHD++MK +          K  FF+  K  +P+FP  EE+ I+WD+YGEI++ ED+L    Q+TE    K  + + +    +    S+VPTKCV++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G PE    L   C     K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L DD  D                      +  D+E +  EE  +    IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC ++YR+R+LLYEQYAIV
Sbjct:    1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLLSHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQSKEADIDSSDESDVEEDVDQPTAHKTKHDLMMKGEGSR-------KGSFFKQAKKSYPMFPAPEER-IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCVSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKD----IKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD-AELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKELGEESEV----IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQ-VAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782          
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:Cpsf2 "cleavage and polyadenylation specific factor 2" species:10090 "Mus musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation specificity factor complex" evidence=ISO;IDA] [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing" evidence=ISO] [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521 MGI:MGI:1861601 GO:GO:0006378 GO:GO:0016787 GO:GO:0003723 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402 OMA:FNAGHTL InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 GeneTree:ENSGT00730000111069 HOGENOM:HOG000264343 CTD:53981 HOVERGEN:HBG051106 OrthoDB:EOG741Z1H TreeFam:TF106131 GO:GO:0006398 EMBL:AF012822 EMBL:BC013628 EMBL:BC007163 RefSeq:NP_058552.1 RefSeq:XP_006516134.1 UniGene:Mm.716 ProteinModelPortal:O35218 SMR:O35218 BioGrid:206172 IntAct:O35218 MINT:MINT-4091947 PhosphoSite:O35218 PaxDb:O35218 PRIDE:O35218 Ensembl:ENSMUST00000047357 GeneID:51786 KEGG:mmu:51786 UCSC:uc007otx.2 InParanoid:O35218 NextBio:308008 PRO:PR:O35218 ArrayExpress:O35218 Bgee:O35218 CleanEx:MM_CPSF2 Genevestigator:O35218 Uniprot:O35218)

HSP 1 Score: 778.089 bits (2008), Expect = 0.000e+0
Identity = 401/796 (50.38%), Postives = 539/796 (67.71%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEIS---------GKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSADI-----IPDEEDDAFEEPSLKK-------------PRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   ++K   +IDAVLLS+PD  HLGALP+AVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VD+AFDKI QLK++Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH ++++ +VPSPKVVLAS PD+E G+SR+LFIQWC +PKNSIILT R+   TLA  L+   + +  E+E++KRV+L G ELEEY +                  D  S DE ++            KHD++MK +          K  FF+  K  +P+FP  EE+ I+WD+YGEI++ ED+L    Q+TE    K  + + +    +    S+VPTKCV++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G PE    L   C     K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L DD  D       P +     ++ ++K                IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC ++YR+R+LLYEQYAIV
Sbjct:    1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLLSHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQSKEADIDSSDESDVEEDVDQPSAHKTKHDLMMKGEGSR-------KGSFFKQAKKSYPMFPAPEER-IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCVSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKD----IKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD-AELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSAMAQQKAMKSLFGEDEKELGEETEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQ-VAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782          
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:CPSF2 "Cleavage and polyadenylation specificity factor subunit 2" species:9606 "Homo sapiens" [GO:0000398 "mRNA splicing, via spliceosome" evidence=TAS] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005515 "protein binding" evidence=IPI] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0005847 "mRNA cleavage and polyadenylation specificity factor complex" evidence=IDA] [GO:0006366 "transcription from RNA polymerase II promoter" evidence=TAS] [GO:0006369 "termination of RNA polymerase II transcription" evidence=TAS] [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing" evidence=IDA] [GO:0006406 "mRNA export from nucleus" evidence=TAS] [GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467 "gene expression" evidence=TAS] [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0031124 "mRNA 3'-end processing" evidence=TAS] Reactome:REACT_71 InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:CH471061 GO:GO:0006378 GO:GO:0016787 GO:GO:0003723 GO:GO:0006406 GO:GO:0000398 Reactome:REACT_1788 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369 eggNOG:COG1236 KO:K14402 OMA:FNAGHTL InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Reactome:REACT_78 HOGENOM:HOG000264343 CTD:53981 HOVERGEN:HBG051106 OrthoDB:EOG741Z1H TreeFam:TF106131 GO:GO:0006398 EMBL:AK001627 EMBL:BC070095 EMBL:AB037788 EMBL:AL442079 RefSeq:NP_059133.1 RefSeq:XP_005267824.1 UniGene:Hs.657632 UniGene:Hs.736541 ProteinModelPortal:Q9P2I0 SMR:Q9P2I0 BioGrid:119826 DIP:DIP-42500N IntAct:Q9P2I0 MINT:MINT-1697677 STRING:9606.ENSP00000298875 PhosphoSite:Q9P2I0 DMDM:51338827 PaxDb:Q9P2I0 PeptideAtlas:Q9P2I0 PRIDE:Q9P2I0 Ensembl:ENST00000298875 GeneID:53981 KEGG:hsa:53981 UCSC:uc001yah.2 GeneCards:GC14P092588 HGNC:HGNC:2325 HPA:HPA024238 MIM:606028 neXtProt:NX_Q9P2I0 PharmGKB:PA26842 InParanoid:Q9P2I0 PhylomeDB:Q9P2I0 ChiTaRS:CPSF2 GeneWiki:CPSF2 GenomeRNAi:53981 NextBio:56268 PRO:PR:Q9P2I0 ArrayExpress:Q9P2I0 Bgee:Q9P2I0 CleanEx:HS_CPSF2 Genevestigator:Q9P2I0 Uniprot:Q9P2I0)

HSP 1 Score: 777.704 bits (2007), Expect = 0.000e+0
Identity = 405/800 (50.62%), Postives = 540/800 (67.50%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEIS---------GKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSAD----------------------IIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   ++K   +IDAVLLS+PD  HLGALPYAVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VD+AFDKI QLK++Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH ++++ +VPSPKVVLAS PD+E G+SR+LFIQWC +PKNSIILT R+   TLA  L+   S +  E+E++KRV+L G ELEEY +                  D  S DE +I            KHD++MK +          K  FF+  K  +P+FP  EE+ I+WD+YGEI++ ED+L    Q+TE    K  + + +    +    S+VPTKC+++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G PE    L   C     K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L DD  D                      +  D+E +  EE  +    IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC ++YR+R+LLYEQYAIV
Sbjct:    1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSR-------KGSFFKQAKKSYPMFPAPEER-IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKD----IKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD-AELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEESEI----IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQ-VAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782          
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:CPSF2 "Uncharacterized protein" species:9615 "Canis lupus familiaris" [GO:0005847 "mRNA cleavage and polyadenylation specificity factor complex" evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing" evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521 GO:GO:0006378 GO:GO:0016787 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 KO:K14402 OMA:FNAGHTL InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 GeneTree:ENSGT00730000111069 CTD:53981 OrthoDB:EOG741Z1H TreeFam:TF106131 GO:GO:0006398 EMBL:AAEX03005582 RefSeq:XP_537353.2 ProteinModelPortal:E2R496 Ensembl:ENSCAFT00000017381 GeneID:480230 KEGG:cfa:480230 NextBio:20855279 Uniprot:E2R496)

HSP 1 Score: 777.319 bits (2006), Expect = 0.000e+0
Identity = 404/800 (50.50%), Postives = 540/800 (67.50%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEIS---------GKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSAD----------------------IIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   ++K   +IDAVLLS+PD  HLGALPYAVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VD+AFDKI QLK++Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH ++++ +VPSPKVVLAS PD+E G+SR+LFIQWC +PKNSIILT R+   TLA  L+   S +  E+E++KRV+L G ELEEY +                  D  S DE ++            KHD++MK +          K  FF+  K  +P+FP  EE+ I+WD+YGEI++ ED+L    Q+TE    K  + + +    +    S+VPTKC+++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G PE    L   C     K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L DD  D                      +  D+E +  EE  +    IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC ++YR+R+LLYEQYAIV
Sbjct:    1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADIDSSDESDVEEDIDQPSAHKMKHDLMMKGEGSR-------KGSFFKQAKKSYPMFPAPEER-IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKD----IKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD-AELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEI----IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQ-VAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782          
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:CPSF2 "Cleavage and polyadenylation specificity factor subunit 2" species:9913 "Bos taurus" [GO:0003723 "RNA binding" evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation specificity factor complex" evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing" evidence=ISS] [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521 GO:GO:0006378 GO:GO:0016787 GO:GO:0003723 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402 OMA:FNAGHTL InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 GeneTree:ENSGT00730000111069 HOGENOM:HOG000264343 EMBL:X75931 PIR:A56351 RefSeq:NP_787002.1 RefSeq:XP_005222184.1 UniGene:Bt.4077 ProteinModelPortal:Q10568 IntAct:Q10568 STRING:9913.ENSBTAP00000013500 PRIDE:Q10568 Ensembl:ENSBTAT00000013500 GeneID:327689 KEGG:bta:327689 CTD:53981 HOVERGEN:HBG051106 InParanoid:Q10568 OrthoDB:EOG741Z1H TreeFam:TF106131 NextBio:20810154 GO:GO:0006398 Uniprot:Q10568)

HSP 1 Score: 776.933 bits (2005), Expect = 0.000e+0
Identity = 402/800 (50.25%), Postives = 543/800 (67.88%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTG-----TELEEYNKHRDELIVKKSLTSVLNGGDESSD----DEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSAD----------------------IIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   ++K   +IDAVLLS+PD  HLGALPYAVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VD+AFDKI QLK++Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH ++++ +VPSPKVVLAS PD+E G+SR+LFIQWC +PKNSIILT R+   TLA  L+   S +  E+E++KRV+L G        +E  K      +++S  + ++  DES      D+      KHD++MK +          K  FF+  K  +P+FP  EE+ I+WD+YGEI++ ED+L    Q+TE    K  + + +    +    S+VPTKC+++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G PE    L   C     K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L DD  D                      +  D+E +  EE  +    IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC ++YR+R+LLYEQYAIV
Sbjct:    1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADIDSSDESDAEEDIDQPSAHKTKHDLMMKGEGSR-------KGSFFKQAKKSYPMFPAPEER-IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKD----IKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD-AELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEI----IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQ-VAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782          
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:cpsf2 "Cleavage and polyadenylation specificity factor subunit 2" species:8355 "Xenopus laevis" [GO:0005847 "mRNA cleavage and polyadenylation specificity factor complex" evidence=ISS] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521 GO:GO:0005737 GO:GO:0006378 GO:GO:0016787 GO:GO:0003723 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 KO:K14402 InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 CTD:53981 HOVERGEN:HBG051106 EMBL:AF139986 RefSeq:NP_001081123.1 UniGene:Xl.3876 ProteinModelPortal:Q9W799 GeneID:394394 KEGG:xla:394394 Xenbase:XB-GENE-950598 Uniprot:Q9W799)

HSP 1 Score: 773.081 bits (1995), Expect = 0.000e+0
Identity = 402/800 (50.25%), Postives = 543/800 (67.88%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIV---------KKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSADI----------------------IPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  L G + E   CYLL+VD++ FLLD GWD  F+  +   +KK   ++DAVLLS+PD  HLGALPYAVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LF+LD+VD AFDKI QLKYNQ   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ + RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH  +++ +VPSPKVVLAS PD+E G+SRELFIQWC +PKNS+ILT R+   TLA  L+   S R I++E++KRV+L G ELEEY +                +  L S  +   E   D++     KHD++MKN+          K  FF+  K  +P+FP  E++ I+WD+YGEI++ ED+L    Q TE+   K  + + +    +    S+VPTKCV++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G P+    L   C     K     I VY+P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L D+  D+                      +  ++D  F E S     IP L+  P +    HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC +++++RELLYEQYAIV
Sbjct:    1 MTSIIKLTTLVGAQEESAVCYLLQVDEFRFLLDCGWDENFSMDIIDSVKKYVHQVDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFSLFSLDDVDCAFDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGYSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRTTPGTLARFLIDHPSERIIDIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQSKEADLDSSDDSDVEEDIDQITSHKAKHDLMMKNEGSR-------KGSFFKQAKKSYPMFPAPEDR-IKWDEYGEIIKPEDFLVPELQVTEDEKTKLESGLTNGDEPMDQDLSDVPTKCVSTTESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDATQDLAEACRAFGGKD----IKVYTPKLHETVDATSETHIYQVRLKDSLVSSLKFCKAKD-TELAWIDGVLDMRVSKVDTGVILEERELKDEGEDMEMQVDTQVMDASTIAQQKVIKSLFGDDDKEFSEES---EIIPTLEPLPSNEVPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLVC-NNMVAVRRTETGRIGLEGCLCEDFFKIRELLYEQYAIV 783          
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:CPSF2 "Uncharacterized protein" species:9031 "Gallus gallus" [GO:0005847 "mRNA cleavage and polyadenylation specificity factor complex" evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing" evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521 GO:GO:0006378 GO:GO:0016787 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 OMA:FNAGHTL InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 GeneTree:ENSGT00730000111069 OrthoDB:EOG741Z1H TreeFam:TF106131 GO:GO:0006398 EMBL:AADN03004841 Ensembl:ENSGALT00000017538 PRO:PR:F1NMN0 Uniprot:F1NMN0)

HSP 1 Score: 770 bits (1987), Expect = 0.000e+0
Identity = 396/800 (49.50%), Postives = 545/800 (68.12%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTG-----TELEEYNKHRDELIVKKSLTSVLNGGDESSD----DEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGV----------------------------INLLTDDSA---------DIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   +KK   ++DAVLLS+PD  HLGALPYAVGK+GL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VD+AFDKI QLK++Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCHS++++ +VPSPKVVLAS PD+E G+SR+LFIQWC + KNSIILT R+   TLA  L+   S + I++E+++RV+L G        +E  K      +++S  + ++  DES      D+  +   KHD++MK +          K  FF+  K  +P+FP  EE+ I+WD+YGEI++ ED+L    Q+TE    K  + + +    +    S+VPTKC+++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++++V G PE    L   C     K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV                            +++ + DS+          +  D++ +  EE  +    IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC ++YR+RELLY+QYAIV
Sbjct:    1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDSLKKHVHQVDAVLLSHPDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKVIDIELRRRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADIDSSDESDAEEDIDQPTVHKTKHDLMMKGEGSR-------KGSFFKQAKKSYPMFPAPEER-IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCISATESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLVIVHGPPEASQDLAECCRAFGGKD----IKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD-AELAWIDGVLDMRVSKVDTGVILEEGELREDEELEMQVDMPSSDSSVIAQQKAMKSLFGDDDKEMCEESEI----IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC-NNMVAVRRTETGRIGLEGCLCQDFYRIRELLYKQYAIV 782          
BLAST of EMLSAG00000011843 vs. GO
Match: - (symbol:cpsf-2 species:6239 "Caenorhabditis elegans" [GO:0009792 "embryo development ending in birth or egg hatching" evidence=IMP] [GO:0000003 "reproduction" evidence=IMP] [GO:0040035 "hermaphrodite genitalia development" evidence=IMP] [GO:0016246 "RNA interference" evidence=IMP] [GO:0040027 "negative regulation of vulval development" evidence=IMP] InterPro:IPR001279 InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0006378 GO:GO:0016787 GO:GO:0003723 Gene3D:3.60.15.10 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402 OMA:FNAGHTL InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027 GeneTree:ENSGT00730000111069 HOGENOM:HOG000264343 OrthoDB:EOG741Z1H EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1 UniGene:Cel.6876 ProteinModelPortal:O17403 SMR:O17403 STRING:6239.F09G2.4 PaxDb:O17403 PRIDE:O17403 EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4 CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938 PRO:PR:O17403 Uniprot:O17403)

HSP 1 Score: 600.897 bits (1548), Expect = 0.000e+0
Identity = 347/856 (40.54%), Postives = 485/856 (56.66%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMK-TFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMT----------KGSGRTIELEIKKRVELTGTELEEYNKHRDE---------------------------LIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-------QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVD---------------GVINLLTDD---SADIIPDEEDDAFEEPSLKKP-----------------------------RIPQ-----------LDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+   SG + EGP CYLL+VD    LLD GWD  F  +  +E+K    KI AVL+S+PD  HLG LPY V K GL+  ++ATVPV+KMGQMF+YD+  +    E+F+ +TLD+VD+AF+K+ Q+KYNQT  LKG   G+  T +PAGHM+GG+IW+I +   EDI+Y VDFNHKKERHLNGC  D   RP +LIT A +I   Q+RR+ RDE+++T IL+T+R  G+ +I  DTAGRVLELAH++DQLW N D+GL  Y+L ++++ + SVV+FAKSQ+EWM+EKL K      R NPF  KH+ LCHS  E+ +V SPKVVL S  DMESG+SRELF+ WC++P+N +ILT+R  + TLA  L+           K   R I L +KKRV L G EL EY + + E                             +   +    +  D  S D  E      DI+ K D          K  FF++ K  FP+FP  EEKV +WDDYGE+++ ED+  IS       Q+ +   V +  E +++  N      E+PTKCV  K+   +  +I+FI++EG S+G+S  KLL  + PR++IVV G+    D  ++     A  G    + + +PE G ++D + ESFIYQV L+++L+  +++    +G+ LAW+D               G  NL+ DD     D+   EE+ A E     +P                             R  +           LD  P  L   HQA FVN+ KLSDFK +LT  G  +EF  G L    G  ++RR+D+G   +EG+   +YY++R L Y+Q+A++
Sbjct:    1 MTSIIKLKVFSGAKDEGPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIPKISAVLISHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDVEEFEHYTLDDVDTAFEKVEQVKYNQTVVLKGDS-GVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKKERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTAGRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRVRSPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTARPASFTLAAKLVNMAERANDGVLKHEDRLISLVVKKRVALEGEELLEYKRRKAERDAEETRLRMERARRQAQANESDDSDDDDIAAPIVPRHSEKDFRSFDGSENDAHTFDIMAKWDNQQ-------KASFFKTTKKSFPMFPYIEEKV-KWDDYGEVIKPEDYTVISKIDLRKGQNKDEPVVVKKREEEEEVYNPNDHVEEMPTKCVEFKNRVEVSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHGSR---DDTRDLVAYFADSGFDTTM-LKAPEAGALVDASVESFIYQVALSDALLADIQFKEVSEGNSLAWIDARVMEKEAIDNMLAVGTSNLMIDDKNREEDVNDQEENGATEGEGNAEPMEIGENGSQESLAISESGKEVENGHTNDSRTKKGTKGKIRGNLILDPLPKRLIPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCSIRRNDTGVFQMEGAFTKDYYKLRRLFYDQFAVL 843          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592908501|gb|GAXK01049874.1| (TSA: Calanus finmarchicus comp31353_c1_seq1 transcribed RNA sequence)

HSP 1 Score: 914.835 bits (2363), Expect = 0.000e+0
Identity = 452/762 (59.32%), Postives = 572/762 (75.07%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDD--EMEISGK--------KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAFEEPSL-KKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCG-DGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSII++ PLSGG  E PHCYLLEVD +N LLD+GWD  F+      + KV  K+DAVLL+YPDL HLGALP AVGKLGLSC ++ATVPV+KMGQMF+YD+Y+A+   EDF LFTLD+VD+ F+KITQLKYNQT  LKGKG+G+++TP+PAGHMIGGTIW+IVKDGEEDI+YAVD+NHKKERHL GCD+++L RPS+LITDAFN    Q RRR+RDE++MTNIL TLRN+GNVL+C DTAGRVLELAHMVDQLW N+DSGLLAYSLAL+NN +++VVEFAKSQIEWMSEKLMK    K+ NPFQFKHLKLCHSM EVNKVP+PKVVLASMPDME G++R+LF+QWC+NPKNS+ILTSRS   TL  DL+T G  RTI +E+++R++L+G ELE++ +       K   + V    D+ SD   EME+  K        KHDI+MK       +  K ++GFF+S K+K+P++P  EEK I++DDYGEI+R ED+L +  S    ++ +  E  ++    + EVPTKCV++  +F I   IQFIDFEGR++G+SI+KL  Q+KPR++I+VRGT E    +K+FC  + + G+    +++ P+NGEV+D TTE FIYQVRL +SL  +L ++  KDG  LAWVDGVI +  D+  DII  + D+   EP+   +P IP L   P D    H   FVNELKLSDFK +LTKNGI SEFQ G L CG    V LRRH+SGR+ IEG L +EYY +R+LLY+QYAIV
Sbjct:   47 MTSIIRLTPLSGGGDESPHCYLLEVDGFNILLDIGWDEKFSPSFITTLSKVVPKVDAVLLTYPDLPHLGALPVAVGKLGLSCPVYATVPVYKMGQMFLYDIYQARHNIEDFTLFTLDDVDATFEKITQLKYNQTVVLKGKGQGLALTPLPAGHMIGGTIWRIVKDGEEDIVYAVDYNHKKERHLPGCDIERLSRPSLLITDAFNTTYTQARRRLRDEQLMTNILATLRNSGNVLVCVDTAGRVLELAHMVDQLWSNKDSGLLAYSLALLNNVAFNVVEFAKSQIEWMSEKLMKVMGEKKANPFQFKHLKLCHSMAEVNKVPAPKVVLASMPDMECGFARDLFLQWCSNPKNSVILTSRSSPGTLGRDLITNGGDRTIPIEVRRRIKLSGLELEQFREKEKSSSSKHHSSLVEEALDDESDSDTEMEVVTKAGDAKAKVKHDIVMK------AETGKKQSGFFKSNKSKYPMYPCVEEK-IKYDDYGEIIRIEDFL-MDTSEPVDDLAEVVEEYEEDVPDKEEVPTKCVSTVQNFQINCGIQFIDFEGRTDGESIMKLTAQLKPRRMILVRGTEENLTAMKDFCSDV-IGGEN---NIFVPKNGEVVDATTERFIYQVRLRDSLFSTLNFNKAKDG-HLAWVDGVIKMTDDERVDIIATDVDEDTAEPTAPAQPVIPVLVPLPDDQVVGHSTNFVNELKLSDFKLVLTKNGIPSEFQAGNLMCGHSSHVQLRRHESGRVMIEGCLSNEYYTIRDLLYQQYAIV 2293          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592769975|gb|GAXK01184593.1| (TSA: Calanus finmarchicus comp135388_c0_seq1 transcribed RNA sequence)

HSP 1 Score: 145.591 bits (366), Expect = 1.627e-35
Identity = 110/407 (27.03%), Postives = 188/407 (46.19%), Query Frame = 0
Query:    5 IKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWD---------PFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYE-AQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVEL-TGTELEEYNKHRD 400
            IK+ PL  G+  G  C LL +   N +LD G           P F+     E   ++  +D V++S+  LDH GALPY    +G +  I+ TVP   +  + + D+ + A  K+ + + FT   +     KI  +  +Q   +  +   + I    AGH++G  +++ VK G + ++Y  D+N   +RHL    +D+  RP +LIT++      +  +R R+   +  +   +   G VLI     GR  EL  +++  W+  +   L   +      +    ++ K  I W +EK+ KTF  +  N F FKH+K        N  P P VV A+   + +G S  +F +WC   KN II+     + T+ + ++  G  R IE E  +  E+    +   ++ H D
Sbjct:   54 IKVTPLGAGQDVGRSCLLLSIGGKNIMLDCGMHMGYSDDRRFPDFSYITTDE--PLSEHLDCVIISHFHLDHCGALPYMTEMIGYNGPIYMTVPTKAIAPILLEDMRKVAVDKKGEQNFFTSAMIKDCTKKIIAVNLHQVVQVDAE---LEIKAYYAGHVLGAAMFQ-VKVGNQSVVYTGDYNMTPDRHLGAAWIDR-CRPDLLITESTYATTVRDSKRCRERDFLKKVHDCIDKGGKVLIPVFALGRAQELCILLETYWERMN---LKCPIYFSAGMTEKANQYYKMFISWTNEKIRKTFVDR--NMFDFKHIKPFDRSYIDN--PGPMVVFATPGMLHAGLSLTIFRRWCGEEKNMIIMPGYCVSGTIGHKILN-GQKR-IEFEKGQVTEVKMSVQYMSFSAHAD 1226          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592906641|gb|GAXK01051734.1| (TSA: Calanus finmarchicus comp29218_c1_seq1 transcribed RNA sequence)

HSP 1 Score: 135.576 bits (340), Expect = 4.987e-32
Identity = 99/372 (26.61%), Postives = 177/372 (47.58%), Query Frame = 0
Query:    2 TSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKV--ASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCH-QLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMT 370
            + ++K+ PL  G+  G  C++++  D   +LD G  P            +  A +ID +L+S+  LDH GALP+ + K       F T     +    + D  +      D  L+T  ++++A DKI  + +++    + +  GI      AGH++G  ++ +   G + I+Y  DF+ +++RHL   ++  + RP +LI ++   G H   +R  R+ +    +   +   G  LI     GR  EL  ++D+ W      L    +   ++ +   +   ++ I  M+EK+ +      NNPF FKH+     ++  + +  P V+LAS   M+SG SRELF  WCT+ KN  I+       TLA  +++
Sbjct: 1137 SDLMKITPLGSGQEVGRSCHIVQFKDKKIMLDCGIHPGLTGMDALPFVDMIEADQIDLLLISHFHLDHAGALPWFLQKTTFRGKCFMTHATKAIFFWLLSDYIKVSNISTDQMLYTDSDLEAAMDKIETINFHE----EKEVAGIKFWCYNAGHVLGAAMFMLEIAGVK-ILYTGDFSREEDRHLMSAEIPNI-RPDVLIVES-TYGTHIHEKREDRENRFTQTVNDIVSRGGRCLIPVFALGRAQELLLILDEYWAAHPE-LSDIPIYYASSLAKKCMAVYQTFINSMNEKIRRQI--AVNNPFVFKHISNLKGIDHFDDI-GPCVILASPGMMQSGLSRELFESWCTDKKNGCIVAGYCVEGTLAKHILS 2219          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592801774|gb|GAXK01152794.1| (TSA: Calanus finmarchicus comp402423_c0_seq1 transcribed RNA sequence)

HSP 1 Score: 70.4774 bits (171), Expect = 1.778e-19
Identity = 37/76 (48.68%), Postives = 50/76 (65.79%), Query Frame = 0
Query:  209 QLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAH-MVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQ 283
            Q RRR RD+++M +I  TLR + +VL+C DT  RVLEL H M+DQ+    D         L+NN +++VVEFAKSQ
Sbjct:    1 QARRRFRDKQLMADIFGTLRMSDSVLVCVDTVARVLELGHNMLDQMRSTED---------LLNNVAFNVVEFAKSQ 201          

HSP 2 Score: 46.9802 bits (110), Expect = 1.778e-19
Identity = 22/53 (41.51%), Postives = 34/53 (64.15%), Query Frame = 0
Query:  162 IVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRV 214
            I + G+   + ++D+NHK ERHL G ++ +L  PS+LIT+AFN   H  +  V
Sbjct:  185 IPESGKNFSVCSMDYNHKNERHLPGSEI*RLS*PSLLITNAFNTTLHTGQEEV 343          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592801773|gb|GAXK01152795.1| (TSA: Calanus finmarchicus comp402423_c0_seq2 transcribed RNA sequence)

HSP 1 Score: 55.8398 bits (133), Expect = 3.962e-15
Identity = 29/68 (42.65%), Postives = 42/68 (61.76%), Query Frame = 0
Query:  209 QLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAH-MVDQLWQNRDSGLLAYSLALVNNFSYS 275
            Q RRR RD+++M +I  TLR + +VL+C DT  RVLEL H M+DQ+    D         L+NN +++
Sbjct:  199 QARRRFRDKQLMADIFGTLRMSDSVLVCVDTVARVLELGHNMLDQMRSTED---------LLNNVAFN 375          

HSP 2 Score: 46.9802 bits (110), Expect = 3.962e-15
Identity = 22/53 (41.51%), Postives = 34/53 (64.15%), Query Frame = 0
Query:  162 IVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRV 214
            I + G+   + ++D+NHK ERHL G ++ +L  PS+LIT+AFN   H  +  V
Sbjct:   57 IPESGKNFSVCSMDYNHKNERHLPGSEI*RLS*PSLLITNAFNTTLHTGQEEV 215          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592758233|gb|GAXK01196180.1| (TSA: Calanus finmarchicus comp752719_c0_seq1 transcribed RNA sequence)

HSP 1 Score: 55.4546 bits (132), Expect = 1.051e-7
Identity = 31/64 (48.44%), Postives = 36/64 (56.25%), Query Frame = 0
Query:  678 HQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRE 741
            H   FVNELKLSDFK +LTKNGI SE Q G L C              ++ EG L  EYY +R+
Sbjct:   20 HSTNFVNELKLSDFKLVLTKNGIPSEVQAGNLMC*QLSRPAEEAQEWFMS-EGCLPHEYYTIRD 208          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592953186|gb|GAXK01005367.1| (TSA: Calanus finmarchicus comp5794816_c0_seq1 transcribed RNA sequence)

HSP 1 Score: 38.1206 bits (87), Expect = 1.892e-2
Identity = 22/52 (42.31%), Postives = 31/52 (59.62%), Query Frame = 0
Query:  439 MNAN-----DVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWL 485
            MN N     +  K + GF +   +KF +FP  EEK I++ DYGEI+R E +L
Sbjct:   61 MNHNTRVKPETGKKQRGFSKQNNSKFSMFPCVEEK-IKYADYGEIIRIEVFL 213          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592850766|gb|GAXK01106778.1| (TSA: Calanus finmarchicus comp36201_c0_seq2 transcribed RNA sequence)

HSP 1 Score: 35.8094 bits (81), Expect = 1.050e-1
Identity = 14/26 (53.85%), Postives = 20/26 (76.92%), Query Frame = 0
Query:  586 DVYSPENGEVLDVTTESFIYQVRLTE 611
            +++ P+ GEV+  TTE FIYQVRL +
Sbjct:    9 NIFEPKTGEVVGATTERFIYQVRLRD 86          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592867026|gb|GAXK01090536.1| (TSA: Calanus finmarchicus comp2518352_c0_seq1 transcribed RNA sequence)

HSP 1 Score: 33.4982 bits (75), Expect = 7.181e-1
Identity = 24/80 (30.00%), Postives = 41/80 (51.25%), Query Frame = 0
Query:  151 AGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNN 230
            +G + GGT+W+++K  EE++I  V        HLN  DL  L +  +L+ D        ++ RV  +K++ +    L NN
Sbjct:  127 SGEIDGGTLWQVLK-VEENLIKNVLL------HLNPVDLINLEKTCVLMRDLI------VQNRVWKQKLLNDFSNMLSNN 327          
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Match: gi|592830207|gb|GAXK01127337.1| (TSA: Calanus finmarchicus comp916720_c0_seq1 transcribed RNA sequence)

HSP 1 Score: 33.4982 bits (75), Expect = 1.209e+0
Identity = 14/30 (46.67%), Postives = 20/30 (66.67%), Query Frame = 0
Query:  510 QSEVPTKCVTSKHSFHIKAQIQFIDFEGRS 539
            + EVP KCV++  ++ I   I FIDF GR+
Sbjct:   20 KEEVPIKCVSTFMNYQINCGI*FIDFGGRT 109          
BLAST of EMLSAG00000011843 vs. L. salmonis peptides
Match: EMLSAP00000011843 (pep:novel supercontig:LSalAtl2s:LSalAtl2s831:281816:286361:1 gene:EMLSAG00000011843 transcript:EMLSAT00000011843 description:"snap_masked-LSalAtl2s831-processed-gene-2.5")

HSP 1 Score: 1556.19 bits (4028), Expect = 0.000e+0
Identity = 750/750 (100.00%), Postives = 750/750 (100.00%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV
Sbjct:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750          
BLAST of EMLSAG00000011843 vs. L. salmonis peptides
Match: EMLSAP00000000010 (pep:novel supercontig:LSalAtl2s:LSalAtl2s1003:5798:7615:-1 gene:EMLSAG00000000010 transcript:EMLSAT00000000010 description:"augustus_masked-LSalAtl2s1003-processed-gene-0.0")

HSP 1 Score: 151.369 bits (381), Expect = 1.045e-38
Identity = 112/412 (27.18%), Postives = 183/412 (44.42%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLK---------KEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDV--YEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVEL-TGTELEEYNKHRD 400
            MT  I++ PL  G+  G  C L+ +   N +LD G    +N + +              +   +DAV++S+  LDH GALPY    +G S  I+ T P   +  + + D+     +RK E  + FT   +     K+  +  +Q   +    E + I    AGH++G  +++ VK G   I+Y  D+N   +RHL    +DK  RP +LIT++      +  +R R+   +  +   +   G VLI     GR  EL  +++  W   +   L   +      +     + K  I W +EK+ KTF  +  N F FK +K        N  P P VV A+   +  G S  +F +WCTN  N II+     A T+ + ++     R +E    K VE+    +   ++ H D
Sbjct:    1 MTEEIRVTPLGAGQDVGRSCLLVSIGGKNLMLDCGMHMGYNDERRFPDFSFIEDTPGAPLTPHLDAVIISHFHLDHCGALPYMTEMVGYSGPIYMTHPTKAIAPILLEDMRRVAVERKGES-NFFTSAMIKECMKKVVAVHLHQVIRVD---ESLEIKAYYAGHVLGAAMFQ-VKVGNRSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTVRDSKRCRERDFLKKVHDCVERGGKVLIPVFALGRAQELCILLETYWDRMN---LKVPIFFSTGLTEKATNYYKMFITWTNEKIRKTFVER--NMFDFKFIKPLDRAYIQN--PGPMVVFATPGMLHGGLSLAIFEEWCTNELNMIIMPGYCVAGTVGHKILN--GTRKLEFSKGKTVEVKMSVQYMSFSAHAD 397          
BLAST of EMLSAG00000011843 vs. L. salmonis peptides
Match: EMLSAP00000007282 (pep:novel supercontig:LSalAtl2s:LSalAtl2s409:129273:131364:-1 gene:EMLSAG00000007282 transcript:EMLSAT00000007282 description:"maker-LSalAtl2s409-augustus-gene-1.22")

HSP 1 Score: 137.117 bits (344), Expect = 1.022e-33
Identity = 98/366 (26.78%), Postives = 176/366 (48.09%), Query Frame = 0
Query:    9 PLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKV--ASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDA-FNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTK 371
            PL  G+  G  C+LLE  D   LLD G  P  +         +  A +ID +L+S+  LDH GALP+ + K      +F T     + +  + D  +      +  L++  +++++ +KI  L +++   +KG    I      AGH++G  ++ I   G    +Y  DF+ +++RHL   +   L +P +LI ++ +    H+ +R  R+ +  + +   +   G  LI     GR  EL  ++D+ W      L    +   ++ +   +   ++ +  M+E++ +      NNPF FKH+     ++  + V  P V+LAS   M+SG SRELF  WC++PKN  I+       TLA  ++++
Sbjct:   15 PLGAGQEVGRSCHLLEFKDKRILLDCGIHPGLSGMDALPFVDLIEADEIDLLLVSHFHLDHAGALPWFLEKTTFKGKVFMTHATKAIYRWLLSDYIKVSNISTEQMLYSEQDLEASMEKIQTLNFHEEKEVKG----IRFWAYNAGHVLGAXMFMIEIAGVR-TLYTGDFSREEDRHLMAXEXPSL-KPHVLILESTYGTNIHE-KREDRESRFTSTVHDIVTRGGRCLIPVFALGRAQELLLILDEYWGAHPE-LHEIPIYYASSLAKKCMAVYQTFVNAMNERIRRQI--SVNNPFVFKHISNLKGIDHFDDV-GPCVILASPGMMQSGLSRELFESWCSDPKNGCIVAGYCVEGTLAKHILSE 369          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|18203548|sp|Q9V3D6.1|CPSF2_DROME (RecName: Full=Probable cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=CPSF 100 kDa subunit)

HSP 1 Score: 806.594 bits (2082), Expect = 0.000e+0
Identity = 415/780 (53.21%), Postives = 549/780 (70.38%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPS-PKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTK-GSGRTIELEIKKRVELTGTELEEYNKHRDE----LIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGK--TGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNV---KQNNE-IKDDRSNIQSEV----------------PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTD--DSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG   E P CY+L++DD   LLD GWD  F++   KE+K+    +DAVLLS+PD  HLGALPY VGKLGL+C I+AT+PV KMGQMFMYD+Y +     DFDLF+LD+VD+AF+KITQLKYNQT +LK KG GISITP+ AGHMIGGTIWKIVK GEEDI+YA DFNHKKERHL+GC+LD+L RPS+LITDA+N    Q RRR RDEK+MTNILQT+RNNGNVLI  DTAGRVLELAHM+DQLW+N++SGL+AYSLAL+NN SY+V+EFAKSQIEWMS+KL K FEG RNNPFQFKH++LCHS+ +V K+P+ PKVVLAS PD+ESG++R+LF+QW +N  NSIILT+R+   TLA +L+     G+ IEL++++RV+L G ELEEY + + E    LIVK  +    +  +   D EM +   KHDI+++         P+G+  +GFF+S K    +FP HEEKV + D+YGEI+  +D+  I+ +T    V   +QN E +K +   I +E                 PTK ++ + +  + AQ+Q IDFEGRS+G+S+LK+L Q++PR+VIV+ GT E    +   CEQ           V++P+ GE++DVT+E  IYQVRLTE LV  L++  GKD +++AWVDG + +     ++   +  E+D + +E           D  P+     H +  +NELKLSDFK  L +N I+SEF GGVL+C +G +ALRR D+G++ +EG L  EYY++RELLYEQYAIV
Sbjct:    1 MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLLSHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVDTAFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKKERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTAGRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEGAELEEYLRTQGEKLNPLIVKPDVEEESS-SESEDDIEMSVITGKHDIVVR---------PEGRHHSGFFKSNKRHHVMFPYHEEKV-KCDEYGEIINLDDY-RIADATGYEFVPMEEQNKENVKKEEPGIGAEQQANGGIVDNDVQLLEKPTKLISQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAEGTQVVARHCEQ------NVGARVFTPQKGEIIDVTSEIHIYQVRLTEGLVSQLQFQKGKD-AEVAWVDGRLGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLADDEIPI-----HNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRVDAGKVAMEGCLSEEYYKIRELLYEQYAIV 756          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|18202027|sp|O35218.1|CPSF2_MOUSE (RecName: Full=Cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=CPSF 100 kDa subunit)

HSP 1 Score: 778.089 bits (2008), Expect = 0.000e+0
Identity = 401/796 (50.38%), Postives = 539/796 (67.71%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEIS---------GKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSADI-----IPDEEDDAFEEPSLKK-------------PRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   ++K   +IDAVLLS+PD  HLGALP+AVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VD+AFDKI QLK++Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH ++++ +VPSPKVVLAS PD+E G+SR+LFIQWC +PKNSIILT R+   TLA  L+   + +  E+E++KRV+L G ELEEY +                  D  S DE ++            KHD++MK +          K  FF+  K  +P+FP  EE+ I+WD+YGEI++ ED+L    Q+TE    K  + + +    +    S+VPTKCV++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G PE    L   C     K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L DD  D       P +     ++ ++K                IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC ++YR+R+LLYEQYAIV
Sbjct:    1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLLSHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQSKEADIDSSDESDVEEDVDQPSAHKTKHDLMMKGEGSR-------KGSFFKQAKKSYPMFPAPEER-IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCVSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKD----IKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD-AELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSAMAQQKAMKSLFGEDEKELGEETEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQ-VAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|51338827|sp|Q9P2I0.2|CPSF2_HUMAN (RecName: Full=Cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=CPSF 100 kDa subunit)

HSP 1 Score: 777.704 bits (2007), Expect = 0.000e+0
Identity = 405/800 (50.62%), Postives = 540/800 (67.50%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEIS---------GKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSAD----------------------IIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   ++K   +IDAVLLS+PD  HLGALPYAVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VD+AFDKI QLK++Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH ++++ +VPSPKVVLAS PD+E G+SR+LFIQWC +PKNSIILT R+   TLA  L+   S +  E+E++KRV+L G ELEEY +                  D  S DE +I            KHD++MK +          K  FF+  K  +P+FP  EE+ I+WD+YGEI++ ED+L    Q+TE    K  + + +    +    S+VPTKC+++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G PE    L   C     K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L DD  D                      +  D+E +  EE  +    IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC ++YR+R+LLYEQYAIV
Sbjct:    1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSR-------KGSFFKQAKKSYPMFPAPEER-IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKD----IKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD-AELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEESEI----IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQ-VAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|1706103|sp|Q10568.1|CPSF2_BOVIN (RecName: Full=Cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=CPSF 100 kDa subunit)

HSP 1 Score: 776.933 bits (2005), Expect = 0.000e+0
Identity = 402/800 (50.25%), Postives = 543/800 (67.88%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTG-----TELEEYNKHRDELIVKKSLTSVLNGGDESSD----DEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSAD----------------------IIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG + E   CYLL+VD++ FLLD GWD  F+  +   ++K   +IDAVLLS+PD  HLGALPYAVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LFTLD+VD+AFDKI QLK++Q   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ L RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH ++++ +VPSPKVVLAS PD+E G+SR+LFIQWC +PKNSIILT R+   TLA  L+   S +  E+E++KRV+L G        +E  K      +++S  + ++  DES      D+      KHD++MK +          K  FF+  K  +P+FP  EE+ I+WD+YGEI++ ED+L    Q+TE    K  + + +    +    S+VPTKC+++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G PE    L   C     K     I VY P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L DD  D                      +  D+E +  EE  +    IP L+  P      HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC ++YR+R+LLYEQYAIV
Sbjct:    1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADIDSSDESDAEEDIDQPSAHKTKHDLMMKGEGSR-------KGSFFKQAKKSYPMFPAPEER-IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKD----IKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD-AELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEI----IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQ-VAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|18203567|sp|Q9W799.1|CPSF2_XENLA (RecName: Full=Cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=CPSF 100 kDa subunit)

HSP 1 Score: 773.081 bits (1995), Expect = 0.000e+0
Identity = 402/800 (50.25%), Postives = 543/800 (67.88%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIV---------KKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINL---------------LTDDSADI----------------------IPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  L G + E   CYLL+VD++ FLLD GWD  F+  +   +KK   ++DAVLLS+PD  HLGALPYAVGKLGL+C+I+AT+PV+KMGQMFMYD+Y+++   EDF LF+LD+VD AFDKI QLKYNQ   LKGKG G+SITP+PAGHMIGGTIWKIVKDGEE+I+YAVDFNHK+E HLNGC L+ + RPS+LITD+FN    Q RR+ RDE+++TN+L+TLR +GNVLI  DTAGRVLELA ++DQ+W+ +D+GL  YSLAL+NN SY+VVEF+KSQ+EWMS+KLM+ FE KRNNPFQF+HL LCH  +++ +VPSPKVVLAS PD+E G+SRELFIQWC +PKNS+ILT R+   TLA  L+   S R I++E++KRV+L G ELEEY +                +  L S  +   E   D++     KHD++MKN+          K  FF+  K  +P+FP  E++ I+WD+YGEI++ ED+L    Q TE+   K  + + +    +    S+VPTKCV++  S  IKA++ +ID+EGRS+GDSI K++ Q+KPR++I+V G P+    L   C     K     I VY+P+  E +D T+E+ IYQVRL +SLV SL++   KD ++LAW+DGV+++               L D+  D+                      +  ++D  F E S     IP L+  P +    HQ+ F+NE +LSDFK +L + GI +EF GGVL C +  VA+RR ++GRI +EG LC +++++RELLYEQYAIV
Sbjct:    1 MTSIIKLTTLVGAQEESAVCYLLQVDEFRFLLDCGWDENFSMDIIDSVKKYVHQVDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFSLFSLDDVDCAFDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGYSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRTTPGTLARFLIDHPSERIIDIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQSKEADLDSSDDSDVEEDIDQITSHKAKHDLMMKNEGSR-------KGSFFKQAKKSYPMFPAPEDR-IKWDEYGEIIKPEDFLVPELQVTEDEKTKLESGLTNGDEPMDQDLSDVPTKCVSTTESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDATQDLAEACRAFGGKD----IKVYTPKLHETVDATSETHIYQVRLKDSLVSSLKFCKAKD-TELAWIDGVLDMRVSKVDTGVILEERELKDEGEDMEMQVDTQVMDASTIAQQKVIKSLFGDDDKEFSEES---EIIPTLEPLPSNEVPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLVC-NNMVAVRRTETGRIGLEGCLCEDFFKIRELLYEQYAIV 783          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|18201967|sp|O17403.1|CPSF2_CAEEL (RecName: Full=Probable cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=CPSF 100 kDa subunit)

HSP 1 Score: 600.897 bits (1548), Expect = 0.000e+0
Identity = 347/856 (40.54%), Postives = 485/856 (56.66%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMK-TFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMT----------KGSGRTIELEIKKRVELTGTELEEYNKHRDE---------------------------LIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS-------QSTENSNVKQNNEIKDDRSNIQ---SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVD---------------GVINLLTDD---SADIIPDEEDDAFEEPSLKKP-----------------------------RIPQ-----------LDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+   SG + EGP CYLL+VD    LLD GWD  F  +  +E+K    KI AVL+S+PD  HLG LPY V K GL+  ++ATVPV+KMGQMF+YD+  +    E+F+ +TLD+VD+AF+K+ Q+KYNQT  LKG   G+  T +PAGHM+GG+IW+I +   EDI+Y VDFNHKKERHLNGC  D   RP +LIT A +I   Q+RR+ RDE+++T IL+T+R  G+ +I  DTAGRVLELAH++DQLW N D+GL  Y+L ++++ + SVV+FAKSQ+EWM+EKL K      R NPF  KH+ LCHS  E+ +V SPKVVL S  DMESG+SRELF+ WC++P+N +ILT+R  + TLA  L+           K   R I L +KKRV L G EL EY + + E                             +   +    +  D  S D  E      DI+ K D          K  FF++ K  FP+FP  EEKV +WDDYGE+++ ED+  IS       Q+ +   V +  E +++  N      E+PTKCV  K+   +  +I+FI++EG S+G+S  KLL  + PR++IVV G+    D  ++     A  G    + + +PE G ++D + ESFIYQV L+++L+  +++    +G+ LAW+D               G  NL+ DD     D+   EE+ A E     +P                             R  +           LD  P  L   HQA FVN+ KLSDFK +LT  G  +EF  G L    G  ++RR+D+G   +EG+   +YY++R L Y+Q+A++
Sbjct:    1 MTSIIKLKVFSGAKDEGPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIPKISAVLISHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDVEEFEHYTLDDVDTAFEKVEQVKYNQTVVLKGDS-GVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKKERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTAGRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRVRSPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTARPASFTLAAKLVNMAERANDGVLKHEDRLISLVVKKRVALEGEELLEYKRRKAERDAEETRLRMERARRQAQANESDDSDDDDIAAPIVPRHSEKDFRSFDGSENDAHTFDIMAKWDNQQ-------KASFFKTTKKSFPMFPYIEEKV-KWDDYGEVIKPEDYTVISKIDLRKGQNKDEPVVVKKREEEEEVYNPNDHVEEMPTKCVEFKNRVEVSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHGSR---DDTRDLVAYFADSGFDTTM-LKAPEAGALVDASVESFIYQVALSDALLADIQFKEVSEGNSLAWIDARVMEKEAIDNMLAVGTSNLMIDDKNREEDVNDQEENGATEGEGNAEPMEIGENGSQESLAISESGKEVENGHTNDSRTKKGTKGKIRGNLILDPLPKRLIPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCSIRRNDTGVFQMEGAFTKDYYKLRRLFYDQFAVL 843          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|229553940|sp|A8XUS3.2|CPSF2_CAEBR (RecName: Full=Probable cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=CPSF 100 kDa subunit)

HSP 1 Score: 538.88 bits (1387), Expect = 6.439e-180
Identity = 314/819 (38.34%), Postives = 457/819 (55.80%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMK-TFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMT----------KGSGRTIELEIKKRVELTGTELEEYNKHRDE---------------------------LIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEI----KDDRSNIQS------EVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVD---------------GVINLLTDDS----------ADIIPDE--EDD-----AFEE----------------------PSLKKPRIPQ---LDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDG 714
            MTSIIK+   SG + EGP CYLL+VD+   LLD GWD  F  K  +E++    KI AVL+S+PD  HLG LPY V K GL+  ++ TVPV+KMGQMF+YD+  +    E+F  ++LD+VD AF+K+ Q+KYNQT  LKG   G++ T +PAGHMIGG++W+I +   EDIIY VDFNH+K+RHL+GC  D   RP +LIT A +I   Q++R+ RDE+++T IL+T+R  G+ +I  DTAGRVLELA+++DQLW N+D+GL  Y+L ++++ + SVV+FAKSQ+EWM EKL +      R NPF  K++ L HS  E+ K+ SPKVVL S  DME+G+SRELF+ WC + +N +ILT+R  + TLA  L+           +   + + L ++KRV L G EL EY + + E                             +   +   L+  D  S D +E      DI+ K D        + K  FF+S K  FP++P  EEKV +WDDYGE+++ ED+  IS+        ++  +    ++D   + +      E+PTKCV  ++   I  +++FI++EG S+G+S  K+L  + PR++I+V G+ +    L  +      K DQ    + +P   E++D + ESFIYQV L+++L+  +++    +G+ LAW+D               G   L  +DS           D+IP E  +DD     A EE                      P   +P+I     L   P      HQA FVN+ KLS+FK +L   G  +EF  G L    G
Sbjct:    1 MTSIIKLKVFSGAKDEGPLCYLLQVDNDYILLDCGWDERFELKYFEELRPYIPKISAVLISHPDPLHLGGLPYLVAKCGLTAPVYCTVPVYKMGQMFIYDLVYSHLDVEEFQHYSLDDVDMAFEKVEQVKYNQTVVLKGDS-GVNFTAMPAGHMIGGSMWRICRITGEDIIYCVDFNHRKDRHLSGCSFDNFNRPHLLITGAHHISLPQMKRKDRDEQLVTKILRTVRQKGDCMIVIDTAGRVLELAYLLDQLWANQDAGLSTYNLVMMSHVASSVVQFAKSQLEWMDEKLFRYDSSSARYNPFTLKNVNLVHSHLELIKIRSPKVVLCSSQDMETGFSRELFLDWCADQRNGVILTARPASFTLAARLVELAERANDGVLRNEDKHLSLLVRKRVPLEGEELLEYKRRKAERDAEETRIRMERARRQAQANESDDSDDDDIAAPIVPRLSEKDHRSFDAIENDSHCFDIMAKWD-------NQQKASFFKSTKKSFPMYPYIEEKV-KWDDYGEVIKPEDYTVISKIDMRKGKNKDEPVVVHKREDEEEVYNPNDHDEEMPTKCVEFRNRIEISCRVEFIEYEGISDGESTKKMLAGLMPRQIIIVHGSRDDTRDLYAYFTDNGFKKDQ----LNTPVANELIDASVESFIYQVSLSDALLAEIQFKEVSEGNSLAWIDARIQEKESIDNMLVAGASQLTIEDSLQEDAVEVVEEDVIPMETFQDDQNKQEASEENVAEGEKSNGQSKENDENASSIPIETQPKIRGTLILTPLPKKQIPVHQAIFVNDPKLSEFKNLLVDKGYKAEFFSGTLLINGG 806          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|18203240|sp|Q9LKF9.2|CPSF2_ARATH (RecName: Full=Cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=AtCPSF100; Short=CPSF 100 kDa subunit; AltName: Full=Protein EMBRYO DEFECTIVE 1265; AltName: Full=Protein ENHANCED SILENCING PHENOTYPE 5)

HSP 1 Score: 501.13 bits (1289), Expect = 1.215e-166
Identity = 280/778 (35.99%), Postives = 451/778 (57.97%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFN-IGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEV-NKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDEL---------IVKKSLTSVLNGGDESSDDEMEISGK-KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWL----DISQSTENSNVKQNNEIKDDRSNIQSEV-PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQ-GGVLFCGDGFVALRR--------HDSG--RITIEGSLCSEYYRVRELLYEQYAIV 750
            M + +++ PL G  +E P  YL+ +D +NFL+D GW+  F++ L + + +VAS IDAVLLS+PD  H+GALPYA+ +LGLS  ++AT PVH++G + MYD + ++++  DFDLFTLD++DSAF  + +L Y+Q + L GKGEGI I P  AGHM+GG+IW+I KDG ED+IYAVD+NH+KERHLNG  L   +RP++LITDA++ +  +Q  R+ RD++ +  I + L   GNVL+  DTAGRVLEL  +++Q W  R      Y L  V   S S +++ KS +EWMS+ + K+FE  R+N F  +H+ L  +  ++ N  P PKVVLASM  +E+G++RE+F++W  +P+N ++ T      TLA  L +    + +++ + KRV L G EL  Y + ++ L         +VK+  T   +G D++S + M I  K  HD+I  + P   + +     GF     +  P+FP + +    WDD+GEI+  +D++    D+ +   ++    +  + +  +++  +  P+K ++++    +   +  +D+EGRS+G SI  ++  + P K+++V    E  + LK  C             VY+P+  E +DVT++   Y+V+L+E L+ ++ +    D S++AWVD  +     D   ++P         P    P               H+   V +LK++DFK  L+  G+  EF  GG L CG+ +V LR+          SG  +I IEG LC +YY++R+ LY Q+ ++
Sbjct:    1 MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASTIDAVLLSHPDTLHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDIDSAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDG-EDVIYAVDYNHRKERHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTAGRVLELLLILEQHWSQRGFSFPIYFLTYV---SSSTIDYVKSFLEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASHGSDDNSSEPMIIDTKTTHDVIGSHGPAYKDIL---IDGFVPPSSSVAPMFPYY-DNTSEWDDFGEIINPDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTVSCSLVKMDYEGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICP------HVYAPQIEETVDVTSDLCAYKVQLSEKLMSNVIFKKLGD-SEVAWVDSEVGKTERDMRSLLP--------MPGAASP---------------HKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGE-YVTLRKVGPTGQKGGASGPQQILIEGPLCEDYYKIRDYLYSQFYLL 739          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|75253249|sp|Q652P4.1|CPSF2_ORYSJ (RecName: Full=Cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=CPSF 100 kDa subunit)

HSP 1 Score: 498.049 bits (1281), Expect = 1.816e-165
Identity = 278/780 (35.64%), Postives = 437/780 (56.03%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVP-SPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLN---------GGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKT-----GFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSN------IQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRR-HDSG--------RITIEGSLCSEYYRVRELLYEQYAIV 750
            M + +++ PLSG   EGP CYLL VD + FLLD GW    +    + + KVA  IDAVLLS+ D  HLGALPYA+  LGLS  ++AT PV ++G + +YD + ++R+  DFDLFTLD++D+AF  + +LKY+Q   L  KGEGI I P  AGH +GGT+WKI KDG ED++YAVDFNH+KERHLNG  L   +RP++LITDA+N   + + +R +D+  +  +++ L   G+VL+  DTAGRVLE+  +++Q W  R    L Y +  + N S S V++ KS +EWM++ + K+FE  R+N F  K +    + +E+ K+  +PKVVLASM  +E G+S ++F+      KN ++ T +    TLA  L      + +++ + KR+ L G EL+ Y + ++ +  +++L + LN         G +  + D M I     D      P NA     G       GF     +  P+FP   E    WDD+GE++  ED+L   +  +N+ +    +  D   +      +    P+K ++++ +  +K  + ++DFEGRS+G S+  ++  + P K+++V G+ E  + LK  C +         + VY+P+  E +DVT++   Y+V+L+E L+ ++  S      ++AWVD  +   TDD   ++P                       P      H++  V +LKL+DFK  L   G+  EF GG L CG+ ++ LR+  D+G        +I IEG LC +YY++RELLY Q+ ++
Sbjct:    1 MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDPSHLQPLAKVAPTIDAVLLSHADTMHLGALPYAMKHLGLSAPVYATEPVFRLGILTLYDYFISRRQVSDFDLFTLDDIDAAFQNVVRLKYSQNHLLNDKGEGIVIAPHVAGHDLGGTVWKITKDG-EDVVYAVDFNHRKERHLNGTALGSFVRPAVLITDAYNALNNHVYKRQQDQDFIDALVKVLTGGGSVLLPIDTAGRVLEILLILEQYWAQRH---LIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFLLKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLGSNAKASDPMVI-----DASTSRKPSNAGSKFGGNVDILIDGFVPPSSSVAPMFPFF-ENTSEWDDFGEVINPEDYLMKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVKCSLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCSK------NSDLHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNV-ISKKLGEHEIAWVDAEVG-KTDDKLTLLP-----------------------PSSTPAAHKSVLVGDLKLADFKQFLANKGLQVEFAGGALRCGE-YITLRKIGDAGQKGSTGSQQIVIEGPLCEDYYKIRELLYSQFYLL 738          
BLAST of EMLSAG00000011843 vs. SwissProt
Match: gi|74858209|sp|Q55BS1.1|CPSF2_DICDI (RecName: Full=Cleavage and polyadenylation specificity factor subunit 2; AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit; Short=CPSF 100 kDa subunit)

HSP 1 Score: 365.155 bits (936), Expect = 1.188e-113
Identity = 190/416 (45.67%), Postives = 269/416 (64.66%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAF--DKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDK-LLRPSILITDAFNIGCHQLRRR--VRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVP-SPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTK-----GSGRTIELEIKKRVELTGTELEEYN----KHRDE 401
            M SIIK   LSG + E P CYLLE+DD+  LLD G     +  L + ++KVA KIDAVLLS+ D  H+G LPY VGK GL+ +I+ T PV KMG MF+YD+YE +  QE+F  ++LD +DS F  D+  +L ++Q ++L GKG+GISITP  AGH IG ++WKI K G   I+YA+D+NH+ E HL+   L   +L+PS+LITD+  +      ++   RD+ +   I + LR+ GNVLI  DTAGRVLEL   ++  W +++  L  YS+  +  FS+SV +FA+SQ+E+MS      FE    NPF FKH+K+  S+ E+ ++P + KV+L S  D+E+G+SRELFIQWC++PK  I+ T + P ++LA  L+ +     G G+ IE+    RV LTG EL +Y     K R+E
Sbjct:    1 MASIIKFTALSGAKDESPPCYLLEIDDFCILLDCGLSYNLDFSLLEPLEKVAKKIDAVLLSHSDTTHIGGLPYVVGKYGLTGTIYGTTPVLKMGTMFLYDLYENKMSQEEFQQYSLDNIDSCFGEDRFKELSFSQHYSLSGKGKGISITPYLAGHTIGASVWKITK-GTYSIVYAIDYNHRNEGHLDSLQLTSDILKPSLLITDSKGVDKTLAFKKTITRDQSLFEQINRNLRDGGNVLIPVDTAGRVLELLLCIENYW-SKNKSLALYSVVFLGRFSFSVCQFARSQLEFMSSTASVKFEQNIENPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRELFIQWCSDPKTLILFTQKIPKDSLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGDELLQYEMEQAKQREE 414          

HSP 2 Score: 122.094 bits (305), Expect = 2.506e-27
Identity = 89/312 (28.53%), Postives = 158/312 (50.64%), Query Frame = 0
Query:  453 RSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFC-EQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADI------IP----DEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGR---ITIEGSLCSEYYRVRELLYEQYAIV 750
            +SM   FP F  H    ++W +YGE    +D +       N + K      ++    + E+P K +T      I  +IQ ID+EG S+G SI  ++QQI P K++++RG+ ++  +++N+  E I  KG      +Y P  GE LD+T+++ +Y++ L +SLV +L+ S   D  +++++ G +++L   +  +      IP    +  ++     +        + +        H  +F+ ++KLSD K +L   GI  +F  G+L CG      R  D G    I ++G +  EYY ++ELLY+Q+ IV
Sbjct:  491 QSMITMFPYFEKH----LKWGEYGE--EDDDLI-----LRNQDKKVEEVTMEEDEIQEQEIPKKIITQTLRLPINCKIQTIDYEGCSDGRSIKAIIQQIAPTKLVLIRGSEQQSQSIENYVKENIRTKG------IYIPSIGEQLDLTSDTNVYELLLKDSLVNTLKTSKILD-YEVSYIQGKVDILDGSNVPVLDLIQSIPINNNNNNNNNNNNNNNNNNNNTTMMTTTTTTTNGHDESFIGDIKLSDLKQVLVNAGIQVQFDQGILNCGGLVYIWRDEDHGGNSIINVDGIISDEYYLIKELLYKQFQIV 784          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: XP_006561140.1 (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Apis mellifera])

HSP 1 Score: 863.218 bits (2229), Expect = 0.000e+0
Identity = 425/759 (55.99%), Postives = 555/759 (73.12%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDES--SDDEMEISG--KKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRS----NIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIP-DEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG   E P CY+L+VD+   LLD GWD  F+ +  +E+K+   +IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWC NP+NSIILTSR+   TLA DL+ KG  R I LE+K+R++L G ELEEY + +++L  ++     +   D S  S+DE+E+ G   KHD+++K +          K GFF+  K + P+FP  EEK I+ D+YGEI+R ED+       E  + K+N E K + +     I +++PTKC+    +  + A + +IDFEGRS+G+S+ K+L Q++PR+V++VRG+    + L    +    +       V+ P  GE LD TTE+ IYQVRLT++LV  L +S GK  S++AWVD +I        D +   E +DA ++      +I  L+  PL+    HQ TF+NELKLSDFK IL K+ I SEF GGVL+C +  +A+RRH++G++ +EG +  +YY+VRELLYEQYAIV
Sbjct:    1 MTSIIKLHAVSGAMDESPPCYILQVDELRILLDCGWDENFDQEFIRELKRHVHQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLELEEYQR-KEKLKQEQLKQEQMETADVSSESEDEIEVGGGRGKHDLLVKQE---------SKPGFFKQSKKQHPMFPFVEEK-IKIDEYGEIIRPEDYKIAETMPEVDDNKENLETKQEDTAHHPEIPTDIPTKCIQVTRTMTVNASVTYIDFEGRSDGESLQKILAQLRPRRVVLVRGSQRDTEILAQQAQSAGAR-------VFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWVDAMITARDQICRDAVAGTESNDAIDQSD----KILTLEPLPLNEVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRRHEAGKVILEGCISEDYYKVRELLYEQYAIV 737          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: XP_006561139.1 (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Apis mellifera])

HSP 1 Score: 863.218 bits (2229), Expect = 0.000e+0
Identity = 425/759 (55.99%), Postives = 555/759 (73.12%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDES--SDDEMEISG--KKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRS----NIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIP-DEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG   E P CY+L+VD+   LLD GWD  F+ +  +E+K+   +IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWC NP+NSIILTSR+   TLA DL+ KG  R I LE+K+R++L G ELEEY + +++L  ++     +   D S  S+DE+E+ G   KHD+++K +          K GFF+  K + P+FP  EEK I+ D+YGEI+R ED+       E  + K+N E K + +     I +++PTKC+    +  + A + +IDFEGRS+G+S+ K+L Q++PR+V++VRG+    + L    +    +       V+ P  GE LD TTE+ IYQVRLT++LV  L +S GK  S++AWVD +I        D +   E +DA ++      +I  L+  PL+    HQ TF+NELKLSDFK IL K+ I SEF GGVL+C +  +A+RRH++G++ +EG +  +YY+VRELLYEQYAIV
Sbjct:    1 MTSIIKLHAVSGAMDESPPCYILQVDELRILLDCGWDENFDQEFIRELKRHVHQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLELEEYQR-KEKLKQEQLKQEQMETADVSSESEDEIEVGGGRGKHDLLVKQE---------SKPGFFKQSKKQHPMFPFVEEK-IKIDEYGEIIRPEDYKIAETMPEVDDNKENLETKQEDTAHHPEIPTDIPTKCIQVTRTMTVNASVTYIDFEGRSDGESLQKILAQLRPRRVVLVRGSQRDTEILAQQAQSAGAR-------VFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWVDAMITARDQICRDAVAGTESNDAIDQSD----KILTLEPLPLNEVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRRHEAGKVILEGCISEDYYKVRELLYEQYAIV 737          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: gb|KFM58192.1| (Cleavage and polyadenylation specificity factor subunit 2, partial [Stegodyphus mimosarum])

HSP 1 Score: 845.884 bits (2184), Expect = 0.000e+0
Identity = 429/772 (55.57%), Postives = 554/772 (71.76%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEISG------------------KKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAFEEPSLKK----PRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+ PLSG  SE PHCY+L+VD++ FLLD GWD  FN    KE+KK   +IDAVLLSYPD+ HLGALPYAVGK  L C I+AT+PV+KMGQMFMYD+++++   EDFDLFTLD+VD+AFDKI QLKY+QT  LKGKG+GI+ITP+P GHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGC L+   RPS+LITD++N    Q+RRR RDE +MTNIL+TLR+NGNVLI  DT+GRVLEL+HM+DQLW+++DSGL+AYSLAL+NN SY+VVEFAKSQ+EWMSEK+MK FEG+R+NPFQFKH++LCHS+ E++K+P PKVVLASMP ME G+SRELFIQWC N +NSIILTSR    TLA  L+     RTIEL+I++RV L G ELEEY K   E          L      ++ E E++G                   +HD++MK         P+G  GFF+  K  +P+FP  EEK I+WDDYGEI++ ED++ +  +      K+N    DD     +E+PTKC++   +  IKA +QFIDFEGRS+ +SI K+L  IKPR++I+VRG PE  ++L  +C    ++G      V++P   E++D TTES IYQV+L +SLV SL++   KD  +LAWVDG I +L +D  +I+  + +   +E  +K+     RIP L   P +    H   F+NE+KLSDFK +L ++GI++EF GGVL+C D  VALR+++SG I  EG L  +Y++VRELLYEQYAI+
Sbjct:    1 MTSIIKLLPLSGVYSEDPHCYILQVDEFRFLLDCGWDENFNMTHIKELKKHIHQIDAVLLSYPDILHLGALPYAVGKCNLDCPIYATIPVYKMGQMFMYDLFQSRHNTEDFDLFTLDDVDAAFDKIIQLKYSQTINLKGKGQGITITPLPGGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCILETFNRPSLLITDSYNANYVQVRRRARDELLMTNILKTLRSNGNVLIAVDTSGRVLELSHMMDQLWRSKDSGLMAYSLALLNNVSYNVVEFAKSQVEWMSEKIMKNFEGQRSNPFQFKHVQLCHSLGELSKIPEPKVVLASMPGMECGFSRELFIQWCGNERNSIILTSRGLPGTLARTLIESPEKRTIELQIRRRVRLEGVELEEYLKREKE--------KELEAARHRAEKEAELAGSDSSEESEDELEIDRGGNARHDLMMK-----LEGKPRG-GGFFKQAKKSYPMFPIKEEK-IKWDDYGEIIKLEDYMILEPTNLEEENKENKMENDDSVQDITELPTKCISYMQTLDIKASVQFIDFEGRSDSESIKKILSMIKPRRLIIVRGPPEATESLATYCVSGVVQG-----KVFTPHLLEMVDATTESHIYQVKLKDSLVSSLDFVKSKD-VELAWVDGEI-ILEEDIDEIVAKDAEKEKDEDEIKEEAAIERIPVLQPLPANQIIGHPTIFINEVKLSDFKQVLMRHGINAEFSGGVLYCND-VVALRKNESGHIHFEGCLTEDYFKVRELLYEQYAII 749          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: gb|EFA07272.1| (putative cleavage and polyadenylation specificity factor subunit 2-like Protein [Tribolium castaneum])

HSP 1 Score: 845.499 bits (2183), Expect = 0.000e+0
Identity = 422/761 (55.45%), Postives = 560/761 (73.59%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEME-ISGKKHDIIMKNDPMNANDVPKGKT--GFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQ--SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRID--VYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADI----IPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG   E P CY+L+VD+   LLD GWD  F+ ++ KE+++    IDAVL+SYPD+ HLGALPY VGKLGL+C I+AT+PV+KMGQMFMYD++++    EDFDLFTLD+VD+ F+K+ QLKYNQ+  LKGKG G++ITP+PAGHMIGGTIWKI+K GEEDIIYA DFNHKKERHLNGC+L+KL RPS+ ITDAFN    Q RRR RDEK+MTNILQTLRNNGNVL+  DTAGRVLELAHM+DQLW+N++SGLL YSLAL++N SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHS++E+ KV SPKVVLAS PDMESG+SRELF+QWC+NP NSII+T+R+   TLA DL+  G  R I+L +K+RV+L G+ELEEY K + E   ++  +S     D   D EM  IS  +HDI++K +         GKT  GFF+  K ++PI+P HEEK I+ D+YGEI++ ED+      TE  + K+N  IK +   I   +E P+KC+    +  +  Q+Q+IDFEGRS+G+S++K+L Q++PR+VI+VRG+PE  +T+KN C        Q+ +D  V++P  GEV+D TTE+ IYQVRLT++LV  L +   KD +++AW++  I ++ +   D     + +E  +  EE S      P  D+ P      H   F+NELKLS+FK IL K+ I+SEF GGVL+C +G +A+RR ++GR+ +EG +  +YY+V+ELLYEQYA++
Sbjct:    1 MTSIIKLQALSGAMDESPPCYILQVDEVRILLDCGWDEHFDMEIIKEMRRHVHTIDAVLISYPDVAHLGALPYLVGKLGLNCPIYATIPVYKMGQMFMYDLFQSHYNMEDFDLFTLDDVDATFEKVIQLKYNQSVPLKGKGYGLTITPLPAGHMIGGTIWKIMKVGEEDIIYANDFNHKKERHLNGCELEKLQRPSLFITDAFNATYQQARRRARDEKLMTNILQTLRNNGNVLVAVDTAGRVLELAHMLDQLWRNKESGLLVYSLALLSNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSLHELQKVSSPKVVLASSPDMESGFSRELFLQWCSNPNNSIIITTRTSPGTLARDLVDNGGNRQIDLVVKRRVKLEGSELEEYQKSQRE--KREENSSRDEESDSDDDIEMSVISKGRHDIVIKQE---------GKTSGGFFKVTKKQYPIYPFHEEK-IKCDEYGEIIKPEDYKLADVVTETEDNKENVVIKKEEEVIPEVAETPSKCIVLSRTVQVNCQVQYIDFEGRSDGESLMKILSQLRPRRVIIVRGSPESTNTIKNHC--------QENLDARVFAPVRGEVVDATTETHIYQVRLTDALVSQLNFQKAKD-AEVAWLNAQI-VVRESQLDARRMNVDNEPMEVDEEESKILTLEPYGDNIP------HDTVFINELKLSEFKQILAKSNINSEFSGGVLWCSNGTLAIRRVETGRVILEGCISEDYYKVKELLYEQYAVL 733          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: EEB18592.1 (Cleavage and polyadenylation specificity factor 100 kDa subunit, putative [Pediculus humanus corporis])

HSP 1 Score: 826.624 bits (2134), Expect = 0.000e+0
Identity = 424/755 (56.16%), Postives = 557/755 (73.77%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEME---ISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQ-SEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPE-NGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK   +SG   E P C++L+VD++ FLLD GWD  F+ +  KE+KK    IDAV+LS+PD  HLGALPY VGK  LSC I+AT+PV+KMGQMFMYD+Y+++   E+FDLFTLD+VD+AFDKI QLKYNQ+ A+KGKG GI+ITP+PAGHMIGG+IWKI K GEEDIIYAVD+NHKKERHLNGC+L+K+ RPS+LITDAFN    Q RRRVRDEK+MTNILQTLR+NGNVL+  DTAGRVLELAHM++QLW+N++SGLLAYSLA +NN SY+ VEFAKSQIEWMSEKLM++FEG RNNPFQFK+++LCHS +E++KVPSPKVVLAS PDMESG+SRELF+QW +NP NSIILTSR+   TLA DL+  G  R I +EIKKRV+L G ELEEY K+ +E   ++     ++     SDDE+E   +S  +HD ++K+         K  +GFF++ K +  +FP +E KV ++DDYGEI+  + +    +  +  +VK  +E  D+   ++  EVPTKC++      IKAQIQFIDFEGRS+G+SI K++ QI+PR++I++RGT E   +L N    I  K    +I  ++P+   EV+D TTE++IYQ+RLT+ L+ SL +  GK+ +++AW+D  + L  + SAD  P EE+         K  I  LD  P++    H+ +++NELKLSDFK IL KN I+ EF GGVL C  G VA+RRH++GR+ +EG L  +YY+V+ELL +QYAIV
Sbjct:    1 MTSIIKFQAISGAMDESPPCFILQVDEFRFLLDCGWDEKFDQEYMKELKKHVPLIDAVILSHPDPLHLGALPYLVGKCSLSCPIYATIPVYKMGQMFMYDLYQSRYNMEEFDLFTLDDVDAAFDKIIQLKYNQSIAMKGKGYGITITPLPAGHMIGGSIWKIFKVGEEDIIYAVDYNHKKERHLNGCELEKIQRPSLLITDAFNATYQQQRRRVRDEKLMTNILQTLRSNGNVLVTVDTAGRVLELAHMLEQLWRNKESGLLAYSLAFLNNVSYNTVEFAKSQIEWMSEKLMRSFEGARNNPFQFKYVQLCHSFSELSKVPSPKVVLASTPDMESGFSRELFLQWSSNPLNSIILTSRTSPGTLARDLIENGGDRIISIEIKKRVKLEGEELEEYFKNEEERREQERENVDVSS---DSDDELEMIQVSKGRHDFLVKDS--------KPHSGFFKTNKKQNAMFPFYEHKV-KFDDYGEIINPDFYKLEGEKEKMDDVK--DEAMDEEERVEDQEVPTKCISYTKEIMIKAQIQFIDFEGRSDGESIQKIISQIRPRRLILIRGTGESTKSLVN----IVSKSTDAKI--FAPQKKSEVVDATTETYIYQIRLTDQLISSLYFQKGKE-AEVAWLDAQV-LTKNRSADARPSEEEME--IDEELKDEILTLDLLPVEDIPGHETSYINELKLSDFKQILNKNNINCEFSGGVLRCCHGSVAVRRHEAGRVILEGCLSEDYYKVKELLCQQYAIV 731          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: EAA08192.4 (AGAP002474-PA [Anopheles gambiae str. PEST])

HSP 1 Score: 822.772 bits (2124), Expect = 0.000e+0
Identity = 440/773 (56.92%), Postives = 556/773 (71.93%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTK-GSGRTIELEIKKRVELTGTELEEYNKHRDELIVK--KSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGK--TGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDW--LDISQSTENS-NVKQNNEIK--DDRSNIQSEV-----PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQ-IALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGV-------INLLTDDSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIKM  +SG   E P CY+L+VDD   LLD GWD  F+    KEIKK    IDAVLLSYPD  HLGALPY VGKLGL+C I+AT+PV+KMGQMFMYD++ +     DFDLF+LD+VD+AFDKI QLKYNQ+ A+KGKG GI+ITP+PAGH+IGGTIWKIVK GEEDI+YA DFNHKKERHLNGC+L+KL RPS+LITDA+N    Q RRR RDEK MTNILQTLRNNGNVL+  DTAGRVLELAHM+DQLW+N++SGL+AYSLAL+NN SY+VVEFAKSQIEWMS+KLMK+FEG RNNPF FKHL+LCH+M ++ KVPSPKVVLAS PD+ESG+SRELFIQW  N  NSII+TSRS   TLA DL+   G+GR IE++I++RVEL G ELEEY +   E + +  K      +  D   + EM +   KHDI+++         P+G+  TGFF+S K  + +FP HEEK I++D+YGEI++ +D+  +D+   T    + K+N  IK  D +   + EV     PTKCV S+    + AQ+QFIDFEGRS+G+S+LK+L Q++PR+V+VVRG+P     +   C+Q I  +       V++P  GE++D TTE+ IYQVRLTE+LV  LE+  GKD +++AWVD         I+ +  D  D I    DD  ++  L    + Q D  P      H   F+NELKL DFK IL K+ I+SEF GGVL+C +G VALRR D+GR+TIEG +  +YY++RELLYEQYAI+
Sbjct:    1 MTSIIKMHAISGAMDESPPCYILQVDDVRILLDCGWDEKFDQGFIKEIKKYVHTIDAVLLSYPDGSHLGALPYLVGKLGLNCPIYATIPVYKMGQMFMYDMFMSHYNMHDFDLFSLDDVDAAFDKIVQLKYNQSVAMKGKGYGITITPLPAGHLIGGTIWKIVKVGEEDIVYATDFNHKKERHLNGCELEKLQRPSLLITDAYNARYQQARRRARDEKFMTNILQTLRNNGNVLVTVDTAGRVLELAHMLDQLWKNKESGLMAYSLALLNNQSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFTFKHLRLCHTMADLAKVPSPKVVLASSPDLESGFSRELFIQWAPNASNSIIITSRSSPGTLARDLIENGGNGRKIEMDIRRRVELEGAELEEYMRTEGEKLNRSIKKRDLDESSSDSDDELEMNVITGKHDIVVR---------PEGRSHTGFFKSSKKNYAMFPFHEEK-IKYDEYGEIIQPDDYRMVDLGPETNGGDDNKENGGIKTEDIKKEKEDEVTVLDKPTKCVQSRKPIEVNAQVQFIDFEGRSDGESLLKILSQLRPRRVVVVRGSPANTSHIAEHCQQNIGAR-------VFTPNRGEIIDATTETHIYQVRLTEALVSQLEFQKGKD-AEVAWVDAQIVIRNKRIDTMEVDDVDTI----DDKMDKQILTLEPLAQEDLPP------HNPVFINELKLIDFKQILMKSNIASEFSGGVLWCSNGTVALRRVDTGRVTIEGCISEDYYKIRELLYEQYAII 745          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: AAF56844.1 (cleavage and polyadenylation specificity factor 100 [Drosophila melanogaster])

HSP 1 Score: 806.594 bits (2082), Expect = 0.000e+0
Identity = 415/780 (53.21%), Postives = 549/780 (70.38%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPS-PKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTK-GSGRTIELEIKKRVELTGTELEEYNKHRDE----LIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGK--TGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNV---KQNNE-IKDDRSNIQSEV----------------PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTD--DSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG   E P CY+L++DD   LLD GWD  F++   KE+K+    +DAVLLS+PD  HLGALPY VGKLGL+C I+AT+PV KMGQMFMYD+Y +     DFDLF+LD+VD+AF+KITQLKYNQT +LK KG GISITP+ AGHMIGGTIWKIVK GEEDI+YA DFNHKKERHL+GC+LD+L RPS+LITDA+N    Q RRR RDEK+MTNILQT+RNNGNVLI  DTAGRVLELAHM+DQLW+N++SGL+AYSLAL+NN SY+V+EFAKSQIEWMS+KL K FEG RNNPFQFKH++LCHS+ +V K+P+ PKVVLAS PD+ESG++R+LF+QW +N  NSIILT+R+   TLA +L+     G+ IEL++++RV+L G ELEEY + + E    LIVK  +    +  +   D EM +   KHDI+++         P+G+  +GFF+S K    +FP HEEKV + D+YGEI+  +D+  I+ +T    V   +QN E +K +   I +E                 PTK ++ + +  + AQ+Q IDFEGRS+G+S+LK+L Q++PR+VIV+ GT E    +   CEQ           V++P+ GE++DVT+E  IYQVRLTE LV  L++  GKD +++AWVDG + +     ++   +  E+D + +E           D  P+     H +  +NELKLSDFK  L +N I+SEF GGVL+C +G +ALRR D+G++ +EG L  EYY++RELLYEQYAIV
Sbjct:    1 MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLLSHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVDTAFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKKERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTAGRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEGAELEEYLRTQGEKLNPLIVKPDVEEESS-SESEDDIEMSVITGKHDIVVR---------PEGRHHSGFFKSNKRHHVMFPYHEEKV-KCDEYGEIINLDDY-RIADATGYEFVPMEEQNKENVKKEEPGIGAEQQANGGIVDNDVQLLEKPTKLISQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAEGTQVVARHCEQ------NVGARVFTPQKGEIIDVTSEIHIYQVRLTEGLVSQLQFQKGKD-AEVAWVDGRLGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLADDEIPI-----HNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRVDAGKVAMEGCLSEEYYKIRELLYEQYAIV 756          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: EFX73157.1 (hypothetical protein DAPPUDRAFT_58164 [Daphnia pulex])

HSP 1 Score: 801.586 bits (2069), Expect = 0.000e+0
Identity = 405/752 (53.86%), Postives = 543/752 (72.21%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEED-DAFEEPSLKKPRIPQLDS-APLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK C LSG   + PH YLL+VDD+ FLLD GWD   +     E+KK  +KIDAVLLSYPD  HLGALPYAVGKLGL+C ++ATVPV+KMGQMFMYD Y+++   EDFDLFTLD+VD++FDK+ QLKY+Q+  LKGKG+G+ ITP+PAGHM+GGT+WKIVKDGEEDIIYAVD+NHKKERHLNGC+L+K+ RPS+LITDA+N    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM++QLW+N++SGL AYSLAL+NN +Y+V EFAKSQIEWMS+KLMK+FEG RNNPF FK+L+LCH++ EV ++   KVVL+S PD+E G++R+LF  WC++ +NSIILTSRS   TL   L  + + +++ LE+K+RV+L G ELEE+ +   E  +   +        ESS+ E E+   +HDI++++D      V      FF+S K    +FP  E+K I++D+YGEI+R ED++ I++S ++     + E        ++E PTKC+++  +  I A I  IDFEGRS+G+SI+KL++ +KP++ IVVRG+ E C  L+N C       ++     +    GE +D T ES IYQVRL +SL+ SL +   KD +++AW+D  +    +     + D  D D  E  SL+K + P L+   P D+   H+ +++NELKLSDFK +L +NGISSEF GGVL+C +G VALRR++SGR+T+EG +  +YYRVRELLYEQYAI+
Sbjct:    1 MTSIIKFCALSGALDDSPHSYLLKVDDFTFLLDCGWDEKCSEGFIHELKKHVNKIDAVLLSYPDQLHLGALPYAVGKLGLTCPVYATVPVYKMGQMFMYDWYQSKDNMEDFDLFTLDDVDNSFDKVVQLKYSQSVPLKGKGQGLIITPLPAGHMLGGTVWKIVKDGEEDIIYAVDYNHKKERHLNGCELEKIQRPSLLITDAYNTLYAQPRRRSRDEKLMTNILQTLRGGGNVLVAVDTAGRVLELAHMLEQLWRNQESGLRAYSLALLNNVAYNVNEFAKSQIEWMSDKLMKSFEGARNNPFGFKYLQLCHTLPEVLRIAGSKVVLSSCPDLECGFARDLFALWCSDARNSIILTSRSGQGTLGQRLHDQRNLKSVTLELKQRVKLEGAELEEFRRKEREKNILSGIKIKDQTAAESSESEDEVKKGRHDIVVRSDDKTTGAVQH----FFKSSKKHPTMFPYFEDK-IKFDEYGEIIRPEDYV-IAESEDHEMADYSVEKPKWEEEPEAECPTKCISTTTTLAINASIMHIDFEGRSDGESIIKLIESMKPKRTIVVRGSSESCQALQNLCLSTGSSDNK----AFIARKGETIDATIESHIYQVRLKDSLLSSLSFGKAKD-AEVAWIDARLTYQVN-----LTDLRDLDDKENNSLRKEQAPLLEPLEPKDIP-GHETSYINELKLSDFKQVLVRNGISSEFIGGVLWCCNGNVALRRNESGRVTLEGCISDDYYRVRELLYEQYAII 735          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: gb|KPM11263.1| (cleavage and polyadenylation specificity factor subunit 2-like protein [Sarcoptes scabiei])

HSP 1 Score: 679.093 bits (1751), Expect = 0.000e+0
Identity = 375/833 (45.02%), Postives = 517/833 (62.06%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRT----IELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNG-------------------------------GDESSDDEMEI-SGKKHDIIMKNDPMNANDVPKGKT---GFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQST----ENSNVKQNNEIKDD--------RSN------IQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTP-EKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAF-------------------------EEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSII   P+SG  +  P CYLL++D++ FLLD G D   +      +      IDAVLLS+PD  HLGALPY  GK  LSC ++AT PV++MGQMF YD+Y++ +  E+FDLF LD+VD AF+K+ Q+KYNQT ALK KG+GI++TP+PAGHMIGGTIWKIVKDGEEDI+YA D NHKKERHLNGC LD++ RPS+LI D  NI     RR+ RD ++  +I++TLRN G+VLI TDTAGR+LEL+HM+DQ W++ + GL AYS+ L+NNFSY+V EFAKS +EWMS+KLM+ FEG+RNNPF FKH++LCH++ E+ +V  P VVLAS PD E G++RE+F  +  NPKN IILT RS   +LA D + +   +T    ++LE K R+EL G ELE+Y + + E   +K    +L G                                + +  DE  + S  K   I ++D M ++    G+    GFF++ K  +P++P  E K IRWD+YGEI+  +D+     S     +  N+  N+E   D         SN      ++  VPTKCVTS     ++A+I+FIDFEGRS+G+SI KL++ I+P + I++R    E  ++  N+C         +   ++ P+  E++D TTE  IYQVRL ++LV SL +S+ KDG++LAWV+G I +  + S  I+P  E+ A                          ++    +  IP L          HQ  FVNELKLSDFK IL K+GI +EF GGVL C  G V ++R +SGRI +EG++  +Y++VR+LLYEQYAI+
Sbjct:    1 MTSIINFIPISGSLNNSPPCYLLKIDEFCFLLDCGLDENCDLCYINNLSPYIPNIDAVLLSHPDTFHLGALPYLFGKCSLSCDVYATTPVYQMGQMFAYDLYQSHQNYENFDLFCLDDVDLAFEKVIQVKYNQTIALKDKGQGITLTPLPAGHMIGGTIWKIVKDGEEDIVYASDINHKKERHLNGCALDRISRPSMLIIDCSNINYVPERRKKRDSQLFGSIVETLRNFGSVLIGTDTAGRILELSHMLDQFWRS-EPGLQAYSIVLLNNFSYNVFEFAKSLVEWMSDKLMRGFEGQRNNPFAFKHIQLCHNLIELKRVNEPMVVLASQPDFECGFTREIFFTFAQNPKNRIILTQRSFRGSLA-DCLQQIKSQTRPFVLDLERKSRLELFGEELEQYQEQKREEEARKEQEKLLEGEMKKKIKEEEEESEESDDDEFYFISNEINNERNTNKFDEKNVRSSLKKQSINRHDLMLSSRYNDGRVKGGGFFKNAKKSYPMYPDCERK-IRWDEYGEIIDPDDFSIFDSSRIFMEDKENIVNNDEQMIDVNEQNGKINSNLTKSITVEPAVPTKCVTSIERVSVEARIEFIDFEGRSDGESIKKLIKMIRPHRCILIRTNDLESAESFVNYCRTNDCVTSGK---IFVPKILELIDATTERHIYQVRLKDALVSSLRFSSYKDGAELAWVEGEIEM--NLSESILPQHEEIASTAMATIAENENENAMENVKNIEPKEDQKQSNRSSIPILKQLQFSKITPHQTIFVNELKLSDFKQILMKHGIQAEFSGGVLLC-KGQVEVKRTESGRIQLEGTVSDDYFKVRKLLYEQYAIL 824          
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Match: gb|EFA06334.1| (Cleavage and polyadenylation specificity factor 73-like Protein [Tribolium castaneum])

HSP 1 Score: 146.362 bits (368), Expect = 5.688e-36
Identity = 107/406 (26.35%), Postives = 190/406 (46.80%), Query Frame = 0
Query:    5 IKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLK-------KEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQ-RKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRD-SGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVEL-TGTELEEYNKHRD 400
            IK+ PL  G+  G  C LL +   N +LD G    +N + +        +   + S ID V++S+  LDH GALPY    +G S  I+ T P   +  + + D+ +    K+ D + FT   +     K+  +  +Q+  +  +   I I    AGH++G  ++ I + G + ++Y  D+N   +RHL    +DK  RP +LI+++      +  +R R+   +  + + +   G VLI     GR  EL  +++  W+  +    + ++L L    +     + K  I W ++K+ KTF   + N F FKH+K        N  P P VV A+   + +G S ++F +W  N  N +I+       T+ + ++     + +E E K+ VE+    E   ++ H D
Sbjct:    4 IKITPLGAGQDVGRSCILLTMGGKNIMLDCGMHMGYNDERRFPDFSYISQEGPLTSYIDCVIISHFHLDHCGALPYMSEMVGYSGPIYMTHPTKAIAPILLEDMRKVSVEKKGDQNFFTSQMIKDCMKKVIAVTLHQSLMVDNE---IEIKAYYAGHVLGAAMFWI-RVGAQSVVYTGDYNMTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECMDRGGKVLIPVFALGRAQELCILLETYWERMNLKAPVYFALGLTEKAN----NYYKMFITWTNQKIRKTF--VQRNMFDFKHIKPFDRSYIDN--PGPMVVFATPGMLHAGLSLQIFKKWAPNENNMVIMPGFCVQGTVGHKILN--GAKRVEFENKQIVEVKMSVEYMSFSAHAD 394          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|936676729|ref|XP_014237486.1| (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Trichogramma pretiosum] >gi|936676731|ref|XP_014237494.1| PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Trichogramma pretiosum] >gi|936676733|ref|XP_014237503.1| PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Trichogramma pretiosum])

HSP 1 Score: 882.093 bits (2278), Expect = 0.000e+0
Identity = 438/757 (57.86%), Postives = 563/757 (74.37%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDES--SDDEMEISGK-KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDD----RSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPDEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG   E P CY+L+VD+   LLD GWD  F+ ++ KE+K+    IDAVLLSYPD  HLGALPY VGK GLSC I+AT+PV+KMGQMFMYDVY+++   EDF LFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITD+FN    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLMK+FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWC NP+NSII+TSRS   TLA DL+  G  R + +EIKK+V+L G ELEEY K +++L +++     +   D S  S+DE+++ GK KHD+++K +          K GFF+  K  +P+FP  EEK I++DDYGEI++ ED+       E  + K+N E K +         S+VPTKCVT+  S  + A + +IDFEGRS+G+S+ K+L Q++PR+VI+VRG+ +  D       ++A K       V+ P  GE +DVTTE+ IYQVRLT++LV SL++S GK  ++LAWVD  I   T    D++PD E+   E+P  +   I  L+  PL     H+  F+NELKLSDFK IL+K+ ++SEF GGVL+C +  +A+RRH++G+I +EG L  EYY+VRELLYEQYAIV
Sbjct:    1 MTSIIKLHAISGALDESPPCYILQVDELRILLDCGWDEKFDQEMIKELKRHVHTIDAVLLSYPDPLHLGALPYLVGKCGLSCPIYATIPVYKMGQMFMYDVYQSRHNTEDFTLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDSFNATYQQARRRARDEKLMTNILQTLRGGGNVLVGVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFQFKHLQLCHSMTELNQVPSPKVVLASTPDMECGFSRELFLQWCGNPQNSIIITSRSSPGTLARDLIENGGNRNLTIEIKKKVKLEGLELEEYQK-KEKLRLEQQKQEKMEIDDMSSESEDEIDVGGKGKHDLLVKQE---------HKPGFFKQNKKLYPMFPFVEEK-IKFDDYGEIIKPEDYKVADAPAEGEDNKENFESKTEDQFHHPENASDVPTKCVTTSRSIAVNASVTYIDFEGRSDGESLQKILIQLRPRRVILVRGSQKDSD-------KMAQKAQLAGARVFIPTKGETMDVTTETHIYQVRLTDALVSSLKFSRGKSDTELAWVDAAITARTKVRRDVVPDTEN---EDPVDESENILTLEPLPLSEIPGHETAFINELKLSDFKQILSKSNMNSEFSGGVLWCCNNTIAVRRHEAGKIIMEGCLSEEYYKVRELLYEQYAIV 736          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|1070599635|ref|XP_018397152.1| (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Cyphomyrmex costatus] >gi|1009377879|gb|KYN01143.1| putative cleavage and polyadenylation specificity factor subunit 2 [Cyphomyrmex costatus])

HSP 1 Score: 881.708 bits (2277), Expect = 0.000e+0
Identity = 436/759 (57.44%), Postives = 559/759 (73.65%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEI---SGK-KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEV----PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPD-EEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG  +E P CY+L+VD+   LLD GWD  F+    KE+K+   +IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWC+NP+NSIILTSR+   TLA DL+ KG  R I LE+K+RV+L G ELEEY K R++L  ++     +   D SS+ E EI   SG+ KHD+++K +          K GFF+  K + P+FP  EEK I+ D+YGEI++ ED+       E  + K+N E+K D SN   EV    PTKCV    +  + A + +IDFEGRS+G+S+ K+L Q++PR+V++VRG+P+  + L    +    +       V+ P  GE LD TTE+ IYQVRLT++LV  L +S GK  S++AW+D +I        D I D E ++A +E      +I  L+  PL+    HQ TF+NELKLSDFK IL K+ I SEF GGVL+C +  +A+RRH++G++ +EG +  +YY+VRELLYEQYAIV
Sbjct:    1 MTSIIKLHAISGAMNESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVHQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDIYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTPDMECGFSRELFLQWCSNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRVKLEGIELEEYQK-REKLKQEQLKQEQMETADVSSESEDEIEVGSGRGKHDLLVKQE---------SKPGFFKQSKKQHPMFPFVEEK-IKIDEYGEIIKPEDYKIAETVPEVEDNKENTEMKQDESNYHPEVTVDIPTKCVQVSRTMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRVVLVRGSPQDTEILAQQAQSTGAR-------VFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARDQICRDAIADTESENAIDESD----KILTLEPLPLNEVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRRHEAGKVILEGCISEDYYKVRELLYEQYAIV 737          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|1069672274|ref|XP_018317813.1| (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Trachymyrmex zeteki] >gi|1012984858|gb|KYQ58464.1| putative cleavage and polyadenylation specificity factor subunit 2 [Trachymyrmex zeteki])

HSP 1 Score: 879.782 bits (2272), Expect = 0.000e+0
Identity = 434/759 (57.18%), Postives = 558/759 (73.52%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDES--SDDEMEISGK--KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEV----PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPD-EEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG  +E P CY+L+VD+   LLD GWD  F+    KE+K+   +IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWC+NP+NSIILTSR+   TLA DL+ KG  R I LE+K+RV+L G ELEEY K R++L  ++     +   D S  S+DE+E+ G   KHD+++K +          K GFF+  K + P+FP  EEK I+ D+YGEI++ ED+       E  + K+N E+K D SN   EV    PTKCV    +  + A + +IDFEGRS+G+S+ K+L Q++PR+V++VRG P+  + L    +    +       V+ P  GE LD TTE+ IYQVRLT++LV  L +S GK  S++AW+D +I        D I D E ++A +E      +I  L+  PL+    HQ TF+NELKLSDFK +L K+ I SEF GGVL+C +  +A+RRH++G++ +EG +  +YY+VRELLYEQYAIV
Sbjct:    1 MTSIIKLHAISGAMNESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVHQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDIYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTPDMECGFSRELFLQWCSNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRVKLEGIELEEYQK-REKLKQEQLKQEQMETADVSSESEDEIEVGGSRGKHDLLVKQE---------SKPGFFKQSKKQHPMFPFVEEK-IKIDEYGEIIKPEDYKIAETVPEVEDNKENVEMKQDESNYHPEVAVDIPTKCVQVSRTMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRVVLVRGLPKDTEILAQQAQSTGAR-------VFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARDQVCRDAIADTESENAIDESD----KILTLEPLPLNEVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRRHEAGKVILEGCISEDYYKVRELLYEQYAIV 737          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|1070209929|ref|XP_018374083.1| (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Trachymyrmex cornetzi] >gi|1070209931|ref|XP_018374084.1| PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Trachymyrmex cornetzi] >gi|1009389621|gb|KYN11412.1| putative cleavage and polyadenylation specificity factor subunit 2 [Trachymyrmex cornetzi])

HSP 1 Score: 879.011 bits (2270), Expect = 0.000e+0
Identity = 434/759 (57.18%), Postives = 558/759 (73.52%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDES--SDDEMEISGK--KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEV----PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPD-EEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG  +E P CY+L+VD+   LLD GWD  F+    KE+K+   +IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWC+NP+NSIILTSR+   TLA DL+ KG  R I LE+K+RV+L G ELEEY K +++L  ++     +   D S  S+DE+E+ G   KHD+++K +          K GFF+  K + P+FP  EEK I+ D+YGEI++ ED+       E  + K+N E+K D SN   EV    PTKCV       + A + +IDFEGRS+G+S+ K+L Q++PR+V++VRG P+  + L    +    +       V+ P  GE LD TTE+ IYQVRLT++LV  L +S GK  S++AW+D +I        D I D E ++A +E      +I  L+  PL+    HQ TF+NELKLSDFK +L K+ ISSEF GGVL+C +  +A+RRH++G++ +EG +  +YY+VRELLYEQYAIV
Sbjct:    1 MTSIIKLHAISGAMNESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVHQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDIYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTPDMECGFSRELFLQWCSNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRVKLEGMELEEYQK-KEKLKQEQLKQEQMETADVSSESEDEIEVGGSRGKHDLLVKQE---------SKPGFFKQSKKQHPMFPFVEEK-IKIDEYGEIIKPEDYKIAETVPEVEDNKENVEMKQDESNYHPEVAVDIPTKCVQVSRMMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRVVLVRGLPKDTEILAQQAQSTGAR-------VFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARDQVCRDAIADTESENAIDESD----KILTLEPLPLNEVPGHQTTFINELKLSDFKQVLNKSNISSEFSGGVLWCCNNTIAVRRHEAGKVILEGCISEDYYKVRELLYEQYAIV 737          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|826410064|ref|XP_012536785.1| (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Monomorium pharaonis])

HSP 1 Score: 878.241 bits (2268), Expect = 0.000e+0
Identity = 434/759 (57.18%), Postives = 558/759 (73.52%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDES--SDDEMEISG--KKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEV----PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPD-EEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG  +E P CY+L+VD+   LLD GWD  F+    KE+K+   +IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWCTNP+NSIILTSR+   TL  DL+ KG  R I L++K+RV+L G ELEEY K R++L  ++     +   D S  S+DE+E+ G   KHD+++K +          K GFF+  K + P+FP  EEK I+ D+YGEI++ ED+       E  + K+N E+K + +N   EV    PTKCV    +  + A + +IDFEGRS+G+S+ K+L Q++PR+V++VRG+P+  + L    +    +       V+ P  GE LD TTE+ IYQVRLT++LV  L +S GK  S++AW+D +I        D I D E ++A +E      +I  L+  PL+    HQ TF+NELKLSDFK IL K+ I SEF GGVL+C +  +A+RRH++G++ +EG L  EYY+VRELLYEQYAIV
Sbjct:    1 MTSIIKLHAISGAMNESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVHQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDIYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTPDMECGFSRELFLQWCTNPQNSIILTSRTSPGTLGRDLVEKGGNRNITLDVKRRVKLEGIELEEYQK-REKLKQEQLKQEQMETADVSSESEDEIEVGGGRGKHDLLVKQE---------SKPGFFKQSKKQHPMFPFVEEK-IKIDEYGEIIKPEDYKIAETVPEVEDNKENIEMKQEETNYHPEVAMDIPTKCVQVSRTMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRVVLVRGSPKDTEILAQQAQSTGAR-------VFVPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARDQICRDAIADTESENAIDESD----KILTLEPLPLNEVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRRHEAGKVILEGCLSEEYYKVRELLYEQYAIV 737          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|646719772|gb|KDR21766.1| (putative cleavage and polyadenylation specificity factor subunit 2 [Zootermopsis nevadensis])

HSP 1 Score: 877.47 bits (2266), Expect = 0.000e+0
Identity = 436/761 (57.29%), Postives = 568/761 (74.64%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYN-KHRDELIVKKSLTSVLNGGDESSDDEME----ISGKKHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQS--EVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIP----DEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG   E P CY+L+VDD+  LLD GWD  F+ +  KE+++   +IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+  +KGKG G++ITP+PAGHMIGGTIWKIVK GEEDI+YAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLRNNGNVL+  DTAGRVLELAHM+DQLW+N+DSGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM+TFEG RNNPFQFKHL+LCHSM E+++VPSPKVVLAS PDME G+SRELF+QW  N  NSII+T+R+   TLA +L+  G    + LEI++RV L G ELEEY  K R+     +     ++  DE S+DEM+    ++  KHD+++K +        K +TGFF+S K ++P+FP  EEKV ++D+YGEI+R ED+  +  + E  + K+N ++K++        EVPTKC++   +  + AQ+Q+IDFEGRS+G+S+ K+L Q++PR++I+VRGTPE    + N C Q +         V++P  GE++D TTE+ IYQVRLT++LV +LE   GK+ ++LAW+D  I ++ D S D  P     ++D   EE  ++  +I  L+  PL+    HQ  F+NELKLSDFK +L KN I SEF GGVL+C +G VA+RRH++GR+ +EG L  +YYRVRELLYEQYAIV
Sbjct:    1 MTSIIKLHAISGAMDESPPCYILQVDDFRILLDCGWDENFDQEFMKELRRFIHQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDLYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSVPMKGKGYGLTITPLPAGHMIGGTIWKIVKVGEEDIVYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRNNGNVLVTVDTAGRVLELAHMLDQLWRNKDSGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRTFEGARNNPFQFKHLQLCHSMAELSRVPSPKVVLASTPDMECGFSRELFLQWSLNSHNSIIITNRTSPGTLARELIENGGKLALNLEIRRRVRLEGAELEEYQRKERENKEKNQDKNEDVDMSDE-SEDEMDSLSAVAKGKHDLLVKTE-------TKVQTGFFKSNKKQYPMFPFFEEKV-KFDEYGEIIRPEDYKMVDSNPETEDNKENVDLKEEEVTTIDIMEVPTKCISVMKTVRVMAQVQYIDFEGRSDGESLQKILGQLRPRRLILVRGTPESTHAMLNLCRQWS------GARVFAPSRGEIVDATTETHIYQVRLTDALVSALELKKGKE-AELAWLDAQI-MVRDMSKDAKPVIMGVDDDGKDEEDKMEIDKIYTLEPLPLNQVAGHQTAFINELKLSDFKQVLNKNNIPSEFSGGVLWCCNGTVAVRRHEAGRVILEGCLSDDYYRVRELLYEQYAIV 744          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|1058064601|gb|JAS35962.1| (hypothetical protein g.11272 [Clastoptera arizonana])

HSP 1 Score: 877.085 bits (2265), Expect = 0.000e+0
Identity = 442/761 (58.08%), Postives = 564/761 (74.11%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGD----ESSDDEMEISGKK--HDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIP-----DEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  LSG   E P CY+L+VD++  LLD GWD  F+    KE+KK  ++IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   E+FD+FTLD+VD+AFDKI QLKYNQ+  +KGKG GI+ITP+PAGHMIGGT+WKIVK GEEDIIYAVDFNHKKERHLNG +L+KL RPS+LITDAFN    Q  RR RDEK+MTNILQTLRNNGNVL+  DTAGRVLELAHM+DQLWQNRDSGLLAYSLAL+NN SY+VVEFAKSQIEWMSEKLM++FEG RNNPFQFKHL+LCHSM+E++KV  PKVVLAS PDME G+SR+LF+QWC+ P NSII+T+RS   TLA +L+  G  RTI+L +KKRV L G ELEEY +   E   +K   +  N  D      S+DE+EI   K  HD+++K      +++    TGFF+  K ++P+FP +EEK +++DDYGEIVR ED+  +  + E  + K+N +IK++     +EVPTKCVT   + ++ AQ+Q+IDFEGRS+G SI K++ Q++PR++I+VRG  E    L+  C+Q           +++P  GE +DVTTES IYQVRLT++LV SLE+  GKD ++LAW+D  I ++ +    + P     D+ ++   E  +    I  LD  P +    H   F+NELKLSDFK +LTKN I SEF GGVL+C +G VA+RRH++G++T+EG L  EYYRVRELLYEQYAIV
Sbjct:    1 MTSIIKVHALSGSMDESPPCYILQVDEFRILLDCGWDEKFDQDFIKELKKHVNQIDAVLLSYPDPLHLGALPYMVGKCGLNCPIYATIPVYKMGQMFMYDLYQSRHDMEEFDMFTLDDVDAAFDKIVQLKYNQSINMKGKGYGITITPLPAGHMIGGTMWKIVKIGEEDIIYAVDFNHKKERHLNGYELEKLQRPSLLITDAFNATYVQAGRRNRDEKLMTNILQTLRNNGNVLVTVDTAGRVLELAHMLDQLWQNRDSGLLAYSLALLNNVSYNVVEFAKSQIEWMSEKLMRSFEGARNNPFQFKHLQLCHSMSELSKVSGPKVVLASTPDMECGFSRDLFLQWCSKPNNSIIITNRSSPGTLARELVDGGGNRTIQLLVKKRVRLEGAELEEYQRREKE---QKENRNDKNDNDVYLSSESEDELEIMNIKGRHDLLIK------SEIKGSTTGFFKVNKKQYPMFPFYEEK-LKFDDYGEIVRPEDFKMVDSTIEIEDNKENIDIKEEDIGDITEVPTKCVTFAKAINVVAQVQYIDFEGRSDGQSIQKIITQLRPRRLIIVRGNVESTMALQAHCKQWT------DARIFAPSKGETVDVTTESHIYQVRLTDALVSSLEFKKGKD-AELAWLDSQI-VVRNKGITVKPLQMEIDQPEETQGESDVPDDEIYTLDPLPPNQISGHDPVFINELKLSDFKQVLTKNNIPSEFSGGVLWCCNGTVAVRRHEAGKVTLEGCLSEEYYRVRELLYEQYAIV 743          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|1070184024|ref|XP_018349994.1| (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Trachymyrmex septentrionalis] >gi|1009415072|gb|KYN33898.1| putative cleavage and polyadenylation specificity factor subunit 2 [Trachymyrmex septentrionalis])

HSP 1 Score: 877.085 bits (2265), Expect = 0.000e+0
Identity = 433/759 (57.05%), Postives = 558/759 (73.52%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDES--SDDEMEISGK--KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEV----PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPD-EEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG  +E P CY+L+VD+   LLD GWD  F+    KE+K+   +IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLR+ GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWC+NP+NSIILTSR+   TLA DL+ KG  R I LE+K+RV+L G ELEEY K R++L  ++     +   D S  S+DE+E+ G   KHD+++K +          K GFF+  K + P+FP  EEK I+ D+YGEI++ ED+       E  + K+N E+K D  N   EV    PTKCV       + A + +IDFEGRS+G+S+ K+L Q++PR+V++VRG+P+  + L    +    +       V+ P  GE LD TTE+ IYQVRLT++LV  L +S GK  S++AW+D +I        D I D E ++A +E      +I  L+  PL+    HQ TF+NELKLSDFK +L K+ I SEF GGVL+C +  +A+RRH++G++ +EG +  +YY+VRELLYEQYAIV
Sbjct:    1 MTSIIKLHAISGAMNESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVHQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDIYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRSGGNVLVSVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTPDMECGFSRELFLQWCSNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRVKLEGIELEEYQK-REKLKQEQLKQEQMETADVSSESEDEIEVGGSRGKHDLLVKQE---------SKPGFFKQSKKQHPMFPFVEEK-IKIDEYGEIIKPEDYKIAETVPEIEDNKENVEMKQDEFNYHPEVAMDIPTKCVQVSRMMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRVVLVRGSPKDTEILAQQAQSTGAR-------VFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARDQVCRDAIADTESENAIDESD----KILTLEPLPLNEVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRRHEAGKVILEGCISEDYYKVRELLYEQYAIV 737          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|801376998|ref|XP_012064360.1| (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Atta cephalotes] >gi|1068376432|ref|XP_018057949.1| PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Atta colombica] >gi|1068376435|ref|XP_018058020.1| PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Atta colombica] >gi|1068376439|ref|XP_018058097.1| PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Atta colombica])

HSP 1 Score: 876.7 bits (2264), Expect = 0.000e+0
Identity = 433/759 (57.05%), Postives = 557/759 (73.39%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDES--SDDEMEISGK--KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSNIQSEV----PTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPD-EEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG  +E P CY+L+VD+   LLD GWD  F+    KE+K+   +IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWC+NP+NSIILTSR+   TLA DL+ KG  R I LE+K+RV+L G ELEEY K R++L  ++     +   D S  S+DE+E+ G   KHD+++K +          K GFF+  K + P+FP  EEK I+ D+YGEI++ ED+       E  + K+N E+K D  N   EV    PTKCV       + A + +IDFEGRS+G+S+ K+L Q++PR+V++VRG+P+  + L    +    +       V+ P  GE LD TTE+ IYQVRLT++LV  L +S GK  S++AW+D +I        D I D E ++A +E      +I  L+  PL+    HQ TF+NELKLSDFK +L K+ I SEF GGVL+C +  +A+RRH++G++ +EG +  +YY+VRELLYEQYAIV
Sbjct:    1 MTSIIKLHAISGAMNESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVHQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDIYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTPDMECGFSRELFLQWCSNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRVKLEGIELEEYQK-REKLKQEQLKQEQMETADVSSESEDEIEVGGSRGKHDLLVKQE---------SKPGFFKQSKKQHPMFPFVEEK-IKIDEYGEIIKPEDYKIAETVPEVEDNKENVEMKQDEFNYHPEVAVDIPTKCVQVSRMMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRVVLVRGSPKDTEILAQQAQSTGAR-------VFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARDQVCRDAIADTESENAIDESD----KILTLEPLPLNEVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRRHEAGKVILEGCISEDYYKVRELLYEQYAIV 737          
BLAST of EMLSAG00000011843 vs. nr
Match: gi|752861633|ref|XP_011258467.1| (PREDICTED: probable cleavage and polyadenylation specificity factor subunit 2 [Camponotus floridanus] >gi|307189918|gb|EFN74154.1| Probable cleavage and polyadenylation specificity factor subunit 2 [Camponotus floridanus])

HSP 1 Score: 876.7 bits (2264), Expect = 0.000e+0
Identity = 428/759 (56.39%), Postives = 562/759 (74.04%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDES--SDDEMEISGK--KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDISQSTENSNVKQNNEIKDDRSN----IQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKGDQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIPD-EEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK+  +SG   E P CY+L+VD+   LLD GWD  F+    KE+K+  ++IDAVLLSYPD  HLGALPY VGK GL+C I+AT+PV+KMGQMFMYD+Y+++   EDFDLFTLD+VD+AFDKI QLKYNQ+ ++KGKG G+++TP+PAGHMIGGTIWKIVK GEEDIIYAVDFNHKKERHLNGC+L++L RPS+LITDAFN    Q RRR RDEK+MTNILQTLR  GNVL+  DTAGRVLELAHM+DQLW+N++SGLLAYSLAL+NN SY+VVEFAKSQIEWMS+KLM++FEG RNNPFQFKHL+LCHSM E+N+VPSPKVVLAS PDME G+SRELF+QWCTNP+NSII+TSR+   TLA DL+ KG  R I L++K+RV+L G ELEEY K R++L  ++     +   D S  S+DE+E+ G   KHD+++K +          K GFF+  K ++P+FP  EEK I+ D+YGEI++ ED+     + E  + K+N E+K + +N    I +++PTKCV    +  + A + +IDFEGRS+G+S+ K+L Q++PR+V++VRG+P+  + L    +    +       V+ P  GE LD TTE+ IYQVRLT++LV  L +S GK  S++AW+D +I        D + D E ++A  E      +I  L+  PL+    HQ TF+NELKLSDFK +L K+ I SEF GGVL+C +  +A+RRH++G++ +EG +  +YY+VRELL+EQYAIV
Sbjct:    1 MTSIIKLHAVSGAMDESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVNQIDAVLLSYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVDAAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKKERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTAGRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMVELNQVPSPKVVLASTPDMECGFSRELFLQWCTNPQNSIIITSRTSPGTLARDLVEKGGNRNITLDVKRRVKLEGIELEEYQK-REKLKQEQMKQEQMETADVSSESEDEIEVGGARGKHDLLVKQE---------SKPGFFKQSKKQYPMFPFVEEK-IKIDEYGEIIKPEDYKIAETAPEVEDNKENVEMKQEETNHHPEIAADIPTKCVQVSRTMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRVVLVRGSPKDTEILAQQAQSAGAR-------VFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARDQICRDAVADTESENAINESD----KILTLEPLPLNEVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRRHEAGKVILEGCISEDYYKVRELLFEQYAIV 737          
BLAST of EMLSAG00000011843 vs. Tigriopus kingsejongenis genes
Match: maker-scaffold281_size224178-snap-gene-1.30 (protein:Tk10995 transcript:maker-scaffold281_size224178-snap-gene-1.30-mRNA-1 annotation:"cleavage and polyadenylation specificity factor subunit 2")

HSP 1 Score: 971.459 bits (2510), Expect = 0.000e+0
Identity = 485/771 (62.91%), Postives = 591/771 (76.65%), Query Frame = 0
Query:    1 MTSIIKMCPLSGGRSEG------PHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVELTGTELEEYNKHRDELIVKKSLTSVLNGGDESSDDEMEISGK----KHDIIMKNDPMNANDVPKGKTGFFRSMKAKFPIFPTHEEKVIRWDDYGEIVRSEDWLDIS----QSTENSNVKQNNEIKDDRSNIQSEVPTKCVTSKHSFHIKAQIQFIDFEGRSEGDSILKLLQQIKPRKVIVVRGTPEKCDTLKNFCEQIALKG------DQQRIDVYSPENGEVLDVTTESFIYQVRLTESLVKSLEYSTGKDGSQLAWVDGVINLLTDDSADIIP-DEEDDAFEEPSLKKPRIPQLDSAPLDLQCNHQATFVNELKLSDFKAILTKNGISSEFQGGVLFCGDGFVALRRHDSGRITIEGSLCSEYYRVRELLYEQYAIV 750
            MTSIIK  PLSGG          PHCYLLEVD + FL D GWD  F+  +  EIK V +KIDAVLLSYPD+ HLG LPY V  LGLSC I+ATVP++KMGQMFMYD+Y+A+   EDF LFTLDEVD  F+ ITQLKYNQT  LKGKGEGI+ITPIPAGHMIGG+IWKIVK+GEEDI+YAVDFNHKKE+HLNG D++K+ RPS+ ITD FN    QLRRR RDEK+MTNILQTLRNNGNVL+C DTAGRVLELAHM+D LWQ +DSGL+AYSLAL+NN S++V+EFAKSQIEWMS+KLM+ FEG+RNNPFQFKHLKLCHSM EVNKVPSPKVVLASM D+E G+SR+LF+ WC  P+N+II+TSR+   TLA+DL+T G+ R + LEIKKR++LTG ELEE+ +   EL  K   + ++   DESSD+EME  G     KHDIIMK   +NA    + KT FF+++K K  +FP  +EK I+WDDYGEIV+ EDW+D +    Q   +  ++ +    D+  +I  EVPTKCV+S     +KAQ+QFIDFEGRS+G+SI+K+LQQ+KPR++I+VRG  ++C+ L N C+Q+  K        QQ+++VY+P+NGE +D TTESFIYQV+L ESLV  L +S GKDG  LAWVDG I+   D+ ADI P D+ D+A EE   KKP IP L   P D    H   FVNELKLSDFK +LTK+GISSEF GGVLFCG+G VALRRHDSGR+TIEG++C +YYRVR+LLYEQYAIV
Sbjct:   18 MTSIIKFRPLSGGVMPADSDACPPHCYLLEVDQFTFLCDCGWDAQFDMNIMNEIKAVINKIDAVLLSYPDIGHLGGLPYVVSTLGLSCPIYATVPIYKMGQMFMYDLYQARYNMEDFKLFTLDEVDKTFEMITQLKYNQTIQLKGKGEGIAITPIPAGHMIGGSIWKIVKEGEEDIVYAVDFNHKKEQHLNGSDIEKIQRPSLFITDGFNASYRQLRRRDRDEKLMTNILQTLRNNGNVLVCVDTAGRVLELAHMIDHLWQIQDSGLIAYSLALLNNMSFNVIEFAKSQIEWMSDKLMRNFEGRRNNPFQFKHLKLCHSMAEVNKVPSPKVVLASMTDLECGFSRDLFLNWCGKPQNNIIITSRTGEGTLAHDLITNGTNRVLNLEIKKRIKLTGAELEEHRRKERELASKSKQSEMMMEDDESSDEEMETGGTKGPIKHDIIMK---VNAPGKTQQKT-FFKAIKTKHMMFPFVDEK-IKWDDYGEIVKPEDWVDGTIDHEQPKADPYLRGHPNDADEGKDI-VEVPTKCVSSVQKIPLKAQVQFIDFEGRSDGESIVKVLQQVKPRRLILVRGREDQCNNLANHCKQLWAKALESSGQSQQKVNVYTPKNGETVDATTESFIYQVKLPESLVGKLLFSKGKDG-LLAWVDGRISFALDEVADIQPRDDGDEAMEEAPKKKPAIPTLLPLPDDEVHAHPTVFVNELKLSDFKIVLTKSGISSEFAGGVLFCGNGNVALRRHDSGRVTIEGTVCDDYYRVRDLLYEQYAIV 781          
BLAST of EMLSAG00000011843 vs. Tigriopus kingsejongenis genes
Match: snap_masked-scaffold495_size155559-processed-gene-0.4 (protein:Tk09235 transcript:snap_masked-scaffold495_size155559-processed-gene-0.4-mRNA-1 annotation:"cleavage and polyadenylation specificity factor subunit 3")

HSP 1 Score: 140.969 bits (354), Expect = 4.995e-35
Identity = 101/366 (27.60%), Postives = 176/366 (48.09%), Query Frame = 0
Query:    9 PLSGGRSEGPHCYLLEVDDYNFLLDVGWDPFFNSKLKKEIKKV--ASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYEAQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCH-QLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVEFAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTK 371
            PL  G+  G  C++LE  D   LLD G  P  N         +  A KID +L+S+  LDH GALP+ + K       F T     + +  + D  +      +  L++  +++++ +KI  L +++    + +  GI      AGH++G  ++ I   G   ++Y  DF+ +++RHL   +L  L RP +LI +A   G H   +R  R+ +  + I + +   G  LI     GR  EL  ++D+ W      L    +   ++ +   +   ++ +  M++K+ +      +NPF FKH+     ++  + +  P V+LAS   M+SG SRELF  WCT+ KN  I+       TLA  ++++
Sbjct:   28 PLGAGQEVGRSCHILEFKDKRVLLDCGIHPGLNGMDALPFVDMIEADKIDLLLISHFHLDHAGALPWFLQKTTFKGKCFMTHATKAIYRWLLSDFIKVSNIATEQMLYSEQDLETSMEKIDTLNFHE----EKEVNGIKFWAYNAGHVLGAAMFMIEIAGVR-VLYTGDFSREEDRHLMAAELPTL-RPDVLIVEA-TYGTHIHEKREDREHRFTSTIHEIVNRGGRCLIPVFALGRAQELLLILDEYWAAHPE-LHEIPIYYASSLAKKCMAVYQTFVNAMNDKIRRQI--AVSNPFVFKHISNLKGIDHFDDI-GPCVILASPGMMQSGLSRELFESWCTDSKNGCIVAGYCVEGTLAKHILSE 382          
BLAST of EMLSAG00000011843 vs. Tigriopus kingsejongenis genes
Match: maker-scaffold719_size106944-snap-gene-0.30 (protein:Tk04736 transcript:maker-scaffold719_size106944-snap-gene-0.30-mRNA-1 annotation:"integrator complex subunit 11")

HSP 1 Score: 140.198 bits (352), Expect = 5.506e-35
Identity = 110/423 (26.00%), Postives = 188/423 (44.44%), Query Frame = 0
Query:    5 IKMCPLSGGRSEGPHCYLLEVDDYNFLLDVGWD---------PFF------------NSKLKKEIKKVASKIDAVLLSYPDLDHLGALPYAVGKLGLSCSIFATVPVHKMGQMFMYDVYE-AQRKQEDFDLFTLDEVDSAFDKITQLKYNQTFALKGKGEGISITPIPAGHMIGGTIWKIVKDGEEDIIYAVDFNHKKERHLNGCDLDKLLRPSILITDAFNIGCHQLRRRVRDEKIMTNILQTLRNNGNVLICTDTAGRVLELAHMVDQLWQNRDSGLLAYSLALVNNFSYSVVE----FAKSQIEWMSEKLMKTFEGKRNNPFQFKHLKLCHSMNEVNKVPSPKVVLASMPDMESGYSRELFIQWCTNPKNSIILTSRSPANTLAYDLMTKGSGRTIELEIKKRVEL-TGTELEEYNKHRD 400
            I + PL  G+  G  C L+ +   + +LD G           P F            N  + K    +   IDAV++S+  LDH GALP+    +G +  I+ T P   +  + + D+   A  ++ + + FT   V     K+  +  +Q   +   GE I +    AGH++G  +++ V+ G + I+Y  D+N   +RHL    +DK  RP +LI+++      +  +R R+   +  +   +   G VLI     GR  EL  +++  W+  +  +  Y       FS  + E    + K  I W +EK+ KTF     N F+FKH+         N  P P VV A+   + +G S  +F +WC+   N II+     A T+ + ++     R +E +  K VE+    +   ++ H D
Sbjct:    4 ITVTPLGAGQDVGRSCLLVRIGGKHIMLDCGMHMGYNDDRRFPDFSYITGTTLEDVTNVGVAKTNGILTEHIDAVIISHFHLDHCGALPFMTEMVGYNGPIYMTHPTKAIAPILLEDMRRVAVERKGETNFFTSAMVKDCMKKVIAVNLHQIVKV---GEDIELKAYYAGHVLGAAMFQ-VRVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHDCVEKGGKVLIPVFALGRAQELCILLETYWERMNLKVPIY-------FSMGLTEKANNYYKMFITWTNEKIRKTF--VERNMFEFKHITGFDRAYIHN--PGPMVVFATPGMLHAGLSLHIFEEWCSGELNMIIMPGYCVAGTVGHKILN--GARKLEFKKGKPVEVKMSVQYMSFSAHAD 408          
The following BLAST results are available for this feature:
BLAST of EMLSAG00000011843 vs. GO
Analysis Date: 2014-04-02 (Blast vs. GO)
Total hits: 25
Match NameE-valueIdentityDescription
-0.000e+053.21symbol:Cpsf100 "Cleavage and polyadenylation speci... [more]
-0.000e+051.00symbol:cpsf2 "cleavage and polyadenylation specifi... [more]
-0.000e+050.50symbol:Cpsf2 "cleavage and polyadenylation specifi... [more]
-0.000e+050.38symbol:Cpsf2 "cleavage and polyadenylation specifi... [more]
-0.000e+050.63symbol:CPSF2 "Cleavage and polyadenylation specifi... [more]
-0.000e+050.50symbol:CPSF2 "Uncharacterized protein" species:961... [more]
-0.000e+050.25symbol:CPSF2 "Cleavage and polyadenylation specifi... [more]
-0.000e+050.25symbol:cpsf2 "Cleavage and polyadenylation specifi... [more]
-0.000e+049.50symbol:CPSF2 "Uncharacterized protein" species:903... [more]
-0.000e+040.54symbol:cpsf-2 species:6239 "Caenorhabditis elegans... [more]

Pages

back to top
BLAST of EMLSAG00000011843 vs. C. finmarchicus
Analysis Date: 2014-05-09 (TblastN vs C. finmarchicus TSA)
Total hits: 10
Match NameE-valueIdentityDescription
gi|592908501|gb|GAXK01049874.1|0.000e+059.32TSA: Calanus finmarchicus comp31353_c1_seq1 transc... [more]
gi|592769975|gb|GAXK01184593.1|1.627e-3527.03TSA: Calanus finmarchicus comp135388_c0_seq1 trans... [more]
gi|592906641|gb|GAXK01051734.1|4.987e-3226.61TSA: Calanus finmarchicus comp29218_c1_seq1 transc... [more]
gi|592801774|gb|GAXK01152794.1|1.778e-1948.68TSA: Calanus finmarchicus comp402423_c0_seq1 trans... [more]
gi|592801773|gb|GAXK01152795.1|3.962e-1542.65TSA: Calanus finmarchicus comp402423_c0_seq2 trans... [more]
gi|592758233|gb|GAXK01196180.1|1.051e-748.44TSA: Calanus finmarchicus comp752719_c0_seq1 trans... [more]
gi|592953186|gb|GAXK01005367.1|1.892e-242.31TSA: Calanus finmarchicus comp5794816_c0_seq1 tran... [more]
gi|592850766|gb|GAXK01106778.1|1.050e-153.85TSA: Calanus finmarchicus comp36201_c0_seq2 transc... [more]
gi|592867026|gb|GAXK01090536.1|7.181e-130.00TSA: Calanus finmarchicus comp2518352_c0_seq1 tran... [more]
gi|592830207|gb|GAXK01127337.1|1.209e+046.67TSA: Calanus finmarchicus comp916720_c0_seq1 trans... [more]
back to top
BLAST of EMLSAG00000011843 vs. L. salmonis peptides
Analysis Date: 2014-05-10 (Blastp vs. self)
Total hits: 3
Match NameE-valueIdentityDescription
EMLSAP000000118430.000e+0100.00pep:novel supercontig:LSalAtl2s:LSalAtl2s831:28181... [more]
EMLSAP000000000101.045e-3827.18pep:novel supercontig:LSalAtl2s:LSalAtl2s1003:5798... [more]
EMLSAP000000072821.022e-3326.78pep:novel supercontig:LSalAtl2s:LSalAtl2s409:12927... [more]
back to top
BLAST of EMLSAG00000011843 vs. SwissProt
Analysis Date: 2017-02-10 (Blastp vs. SwissProt)
Total hits: 25
Match NameE-valueIdentityDescription
gi|18203548|sp|Q9V3D6.1|CPSF2_DROME0.000e+053.21RecName: Full=Probable cleavage and polyadenylatio... [more]
gi|18202027|sp|O35218.1|CPSF2_MOUSE0.000e+050.38RecName: Full=Cleavage and polyadenylation specifi... [more]
gi|51338827|sp|Q9P2I0.2|CPSF2_HUMAN0.000e+050.63RecName: Full=Cleavage and polyadenylation specifi... [more]
gi|1706103|sp|Q10568.1|CPSF2_BOVIN0.000e+050.25RecName: Full=Cleavage and polyadenylation specifi... [more]
gi|18203567|sp|Q9W799.1|CPSF2_XENLA0.000e+050.25RecName: Full=Cleavage and polyadenylation specifi... [more]
gi|18201967|sp|O17403.1|CPSF2_CAEEL0.000e+040.54RecName: Full=Probable cleavage and polyadenylatio... [more]
gi|229553940|sp|A8XUS3.2|CPSF2_CAEBR6.439e-18038.34RecName: Full=Probable cleavage and polyadenylatio... [more]
gi|18203240|sp|Q9LKF9.2|CPSF2_ARATH1.215e-16635.99RecName: Full=Cleavage and polyadenylation specifi... [more]
gi|75253249|sp|Q652P4.1|CPSF2_ORYSJ1.816e-16535.64RecName: Full=Cleavage and polyadenylation specifi... [more]
gi|74858209|sp|Q55BS1.1|CPSF2_DICDI1.188e-11345.67RecName: Full=Cleavage and polyadenylation specifi... [more]

Pages

back to top
BLAST of EMLSAG00000011843 vs. Select Arthropod Genomes
Analysis Date: 2017-02-20 (Blastp vs. Selected Arthropods)
Total hits: 25
Match NameE-valueIdentityDescription
XP_006561140.10.000e+055.99PREDICTED: probable cleavage and polyadenylation s... [more]
XP_006561139.10.000e+055.99PREDICTED: probable cleavage and polyadenylation s... [more]
gb|KFM58192.1|0.000e+055.57Cleavage and polyadenylation specificity factor su... [more]
gb|EFA07272.1|0.000e+055.45putative cleavage and polyadenylation specificity ... [more]
EEB18592.10.000e+056.16Cleavage and polyadenylation specificity factor 10... [more]
EAA08192.40.000e+056.92AGAP002474-PA [Anopheles gambiae str. PEST][more]
AAF56844.10.000e+053.21cleavage and polyadenylation specificity factor 10... [more]
EFX73157.10.000e+053.86hypothetical protein DAPPUDRAFT_58164 [Daphnia pul... [more]
gb|KPM11263.1|0.000e+045.02cleavage and polyadenylation specificity factor su... [more]
gb|EFA06334.1|5.688e-3626.35Cleavage and polyadenylation specificity factor 73... [more]

Pages

back to top
BLAST of EMLSAG00000011843 vs. nr
Analysis Date: 2017-02-20 (Blastp vs. NR (2/2017))
Total hits: 25
Match NameE-valueIdentityDescription
gi|936676729|ref|XP_014237486.1|0.000e+057.86PREDICTED: probable cleavage and polyadenylation s... [more]
gi|1070599635|ref|XP_018397152.1|0.000e+057.44PREDICTED: probable cleavage and polyadenylation s... [more]
gi|1069672274|ref|XP_018317813.1|0.000e+057.18PREDICTED: probable cleavage and polyadenylation s... [more]
gi|1070209929|ref|XP_018374083.1|0.000e+057.18PREDICTED: probable cleavage and polyadenylation s... [more]
gi|826410064|ref|XP_012536785.1|0.000e+057.18PREDICTED: probable cleavage and polyadenylation s... [more]
gi|646719772|gb|KDR21766.1|0.000e+057.29putative cleavage and polyadenylation specificity ... [more]
gi|1058064601|gb|JAS35962.1|0.000e+058.08hypothetical protein g.11272 [Clastoptera arizonan... [more]
gi|1070184024|ref|XP_018349994.1|0.000e+057.05PREDICTED: probable cleavage and polyadenylation s... [more]
gi|801376998|ref|XP_012064360.1|0.000e+057.05PREDICTED: probable cleavage and polyadenylation s... [more]
gi|752861633|ref|XP_011258467.1|0.000e+056.39PREDICTED: probable cleavage and polyadenylation s... [more]

Pages

back to top
BLAST of EMLSAG00000011843 vs. Tigriopus kingsejongenis genes
Analysis Date: 2018-04-18 (Blastp vs. Tigriopus kingsejongensis proteins)
Total hits: 3
Match NameE-valueIdentityDescription
maker-scaffold281_size224178-snap-gene-1.300.000e+062.91protein:Tk10995 transcript:maker-scaffold281_size2... [more]
snap_masked-scaffold495_size155559-processed-gene-0.44.995e-3527.60protein:Tk09235 transcript:snap_masked-scaffold495... [more]
maker-scaffold719_size106944-snap-gene-0.305.506e-3526.00protein:Tk04736 transcript:maker-scaffold719_size1... [more]
back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
LSalAtl2s831supercontigLSalAtl2s831:281816..286361 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
ensembl2013-09-26 .965016
Blast vs. GO2014-04-02
TblastN vs C. finmarchicus TSA2014-05-09
Blastp vs. self2014-05-10
Blastp vs. SwissProt2017-02-10
Blastp vs. Selected Arthropods2017-02-20
Blastp vs. NR (2/2017)2017-02-20
Blastp vs. Tigriopus kingsejongensis proteins2018-04-18
Properties
Property NameValue
Logic nameensemblgenomes
Descriptionsnap_masked-LSalAtl2s831-processed-gene-2.5
Biotypeprotein_coding
EvidenceIEA
NoteProbable cleavage and polyadenylation specificity factor subunit 2
Cross References
External references for this gene
DatabaseAccession
Ensembl Metazoa (gene)EMLSAG00000011843 (primary cross-reference)
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
EMLSAT00000011843EMLSAT00000011843-707690Lepeophtheirus salmonismRNA


Sequences
The following sequences are available for this feature:

gene from alignment at LSalAtl2s831:281816..286361+

Legend: mRNA
Hold the cursor over a type above to highlight its positions in the sequence below.
>EMLSAG00000011843-694609 ID=EMLSAG00000011843-694609|Name=EMLSAG00000011843|organism=Lepeophtheirus salmonis|type=gene|length=4546bp|location=Sequence derived from alignment at LSalAtl2s831:281816..286361+ (Lepeophtheirus salmonis)
CTGCCATGACTTCCATCATCAAAATGTGCCCCTTAAGCGGGGGCCGCTCT GAGGGACCTCATTGCTATCTCCTGGAAGTGGATGATTACAACTTCCTCCT GGATGTGGGATGGGACCCATTCTTCAACTCCAAACTCAAAAAGGAGATCA AGAAAGTCGCTTCCAAAATAGACGCCGTTCTTCTTTCCTACCCAGATCTA GGCAATGCACTCTTAATTACGAATATATGCCGCTCTTATTAATAAACTAT GTTTTCATCTCACAGATCATCTGGGAGCGCTGCCCTACGCCGTTGGTAAA CTGGGACTCTCCTGCTCCATATTTGCAACTGTTCCCGTCCATAAAATGGG TCAAATGTTCATGTATGACGTCTACGAGGCGCAACGGAAGCAAGAGGACT TTGATCTCTTTACTCTGGATGAAGTGGACTCTGCTTTTGATAAAATAACG CAGCTCAAGTATAATCAAACCTTTGCTCTTAAAGGTGATGATGATTTACT TAAATCAAGCAATATACGGCCCAATATTTCATAGACGTATTTTCGAGTTA ATTATTAGGATGAGTGGCTATAGTTTAGAGCATATATTTGATTTAAGGAC GGATGTTGGCTCCTTTTAACCTTGAAATCTCTTTATATATCATTTTTTTT ACTGTGACAATCCATTTTTACTACTTTAAATCCAATTTGCGGCCGTTTAT AAAGGTTAATAAGAATAATTCTCAATTGATAGAAATTGAGGATTACGTCA TAATTGAAGAAAACCCAGTTTTGAAGGCTCTATATTTTCCTATAAGTATT GATCAAACATCATCAAATGGCTTTAAGGATTAAAAAAGACTTAGCAACTA ATACTAATTGATGCCTATGATAAATAGTGATATTTGAATATAAGCAAAGT ACATAATATGTACATAAAATCCCTATGAAATGTAAAAAATTGTACTTTTA AAAAAAAATCAATTAGAGACGATAAAAGAAGGTTAATTAGAGAGAAATAC CTTATTATTTCCATTTCTACTATATGGAAAAAGATTATATAATCTAATAA TAATGTCGATAATTACTCGTCCAAGATTGCAAATGTAATCATTCTAGCAC GCTTTTGTGTTGTTGACGGTCCACAATTCTTCTGATTTTAATTTTGTTCA ACTTAATTTTTAGGTAAGGGAGAAGGGATTTCTATCACACCCATCCCTGC TGGCCATATGATCGGTGGTACCATATGGAAGATCGTCAAAGACGGTGAAG AAGATATTATCTATGCTGTGGACTTTAACCACAAAAAAGAGAGACATCTC AATGGCTGTGACTTAGACAAACTACTTAGACCCTCAATACTCATAACGGA TGCCTTTAATATCGGTTGTCATCAACTCCGACGACGAGTTCGTGATGAAA AAATAATGACAAACATTTTACAAACTTTGCGGAACAACGGGAATGTTCTT ATTTGTACAGATACGGCAGGAAGAGTCTTGGAGCTGGCTCATATGGTAGA TCAATTGTGGCAAAATAGAGACTCTGGTTTATTAGCTTATTCCCTTGCAC TCGTCAATAACTTCAGTTATAGTGTTGTCGAATTTGCAAAATCTCAGATC GAATGGATGTCTGAGAAATTAATGAAGACCTTTGAGGGGAAGAGAAACAA TCCCTTCCAATTCAAACACTTAAAGCTTTGTCATTCCATGAATGAGGTTA ATAAAGTACCATCCCCGAAAGTAGTGCTTGCGAGCATGCCAGACATGGAG TCGGGTTATTCTCGAGAACTATTTATACAGTGGTGCACAAATCCCAAGAA CTCAATTATTCTTACATCCCGATCACCAGCAAATACGTTAGCCTATGATC TTATGACCAAAGGAAGTGGACGGACCATAGAGTTAGAAATTAAAAAGCGT GTCGAGTTAACGGGAACAGAGTTAGAGGAGTATAACAAGCATCGTGACGA ACTCATTGTTAAAAAGTCATTAACTTCTGTGTTAAATGGTGGTGATGAGA GTTCAGATGATGAAATGGAAATCTCCGGGAAAAAGCATGATATCATAATG AAAAATGATCCAATGAATGCTAATGATGTCCCCAAAGGGAAAACAGGGTT TTTTAGGTCTATGAAGGCTAAATTCCCTATTTTCCCTACTCAYGAAGAAA AGGTAGTAATTAATTATCTCCTTGGTGATAATCTAACTGTAATTCCATTA TTTTTTTATAGATTCGATGGGATGACTATGGTGAAATTGTGAGGTCTGAG GATTGGTTAGATATTTCTCAAAGCACGGAAAATTCTAATGTTAAGCAAAA TAATGAGATCAAAGATGATAGATCCAATATCCAAAGTGAGGTTCCCACTA AATGTGTAACGTCAAAGCATTCCTTTCATATTAAAGCTCAAATTCAATTC ATTGACTTTGAGGGACGATCTGAGGGAGATTCTATCTTGAAACTTTTGCA ACAAGTAAGCCCTGTCATGAATGTATTATTACATATATATTTTAATATTT TTAGATCAAACCTCGGAAAGTCATCGTTGTTCGTGGTACGCCGGAAAAGT GTGACACTCTAAAAAACTTTTGTGAGCAAATTGCCTTAAAGGGTGACCAA CAAAGAATCGATGTCTATTCACCTGAGAACGGGGAAGTTTTAGACGTCAC GACAGAGTCCTTTATTTATCAAGTGCGTTTAACGGAGTCTCTCGTTAAAA GCCTGGAATACAGTACTGGTAAAGATGGGAGTCAACTGGCCTGGGTGGAC GGAGTTATTAACCTCTTAACAGATGATTCAGCAGATATCATTCCGGATGA AGAGGATGATGCCTTCGAAGAGCCATCTCTTAAAAAACCTCGTATTCCAC AATTAGACTCAGCTCCTTTGGATCTTCAATGTAATCATCAGGCAACGTTT GTAAACGAACTTAAGCTATCAGACTTTAAGGCCATTTTAACCAAAAATGG AATCTCATCAGAGTTTCAAGGAGGCGTTTTATTTTGTGGAGATGGTTTTG TGGCGTTGAGACGACATGATTCTGGCAGAATAACAATTGAGGGTTCGTTA TGTTCAGAATATTATCGTGTTCGAGAATTACTCTATGAGCAATATGCAAT TGTTTGAGCGTGTATGAAAATATAAAGAGCGTTTTATTGTTTTTATTCAA TGCAATTGGTCTTATTATTTAGAGCACGGATTTCATATCAATTATGTTTT AGGGAATATTCTGAATCCGTTCTACGTGTTCGTTGAAGTAAATTTAATAT ACAATGTGATACATATATCTAAAATTAATTATTCAAGTTGATACGATATT TATCCTATAAAAAAGTAATTATTGGGGTTTTATTTTGACAATATTATACA AATTAAATTAATCAAGGATTTCAATTTTAATGTTACTATTCACGTTGAGA GATTTGAAATGTATAGTGTAGGTGTGTTGTATTCGGAGAGTAATTGTAAC GTGCGGCCAAGGATGGGTGTGTACTTCTAATGGATACAATGTGAAGGGTT TATTTTCAGTCTTTTGATAAAAAATCCTGTATTTTTCTATGAATTTGTCT GTTGAAGGTGAGACTGACGTCATGGATAATTAGATTTGTCATAGGACTTG TTCGTTTCCTGCTGATGAGCCATTCCTCAATGGCTGTTTTTTCATATACG AAACCATCCGAGCAACTGACAGGATGGATCATAATTTCATGGGTGATGGG ACAAATTAGATCGGAAGGGATTTCCGTTTCATCATTCACTAAAGAAGGTG GGAAAACTTTGGAATAACTTGGAGCCGTGGGCTCTGAAAGAGGATTTGAT ATTTTGTTTCTCCTCCATTGTATGGTTTGAAAAACGTTTTCTGCAGCAGA GGGTTCGGAAAAAGTCTCTGGGAACTTGGATAATAAATCTTCTCTTGTTA AGTGTGCTAATTGAGCTCCGTCAAGTCGTAAACTCTCAAAGGATTCTGAA AATATATGACTGTTAAATCCCTTGATCTTGAGTGAGTTTATCCAAACAGA AACATCTGAGATGGACCATTTTTTCATCTGAAATATGAACAATATTTATT GAATCTATCAATTAGTTTATTTCATTTTTACATCTGTGTGGTGAAATGAT TGGATATTCTCATCTACGAGAGAACCACTTAGTTTCCAAAGTGATATAAA CTCATCATTTGAGCCAGTGGCCAGATACATATATTTAGATCCATCTAATT CAAATAAGGGTTGACTAAAGGCACTGCAAGGAACATAACGACATTGCGTT TCTAGTACACGTAGAACTTGAAAATCAAATCCATCCCATAGACGTACTGT TTTGTCACCAGAAGTGGATACCATTAATTGTCGACTATCTCCCTTAAATA AGAAAACAACAAATTAATAAGATAATATATCATTTGATCAAGTCATTGAG CTTGAGATTACTTTTGAGAAAGAAACAGACATTATAACTCCTCCGTGTCC ACTCAGACTCTTATGAGGCTGGATGTGAGCATTATCCACTTTCCAAATCT TTATCAAGCAATCATTTCCCCCAGTAGCAAGATAGTACTCCTCGCA
back to top
Add to Basket