some sequence information for gst klh and bsa 4-1-13

2014-08-29

azim58 - some sequence information for gst klh and bsa 4-1-13


potential klh transcript


ATTCTGGTGCGCAAAAACATTCATAGCCTGAGCCATCATGAAGCGGAAGAACTGCGCGATGCGCTGTATAAACTG
CAGAACGATGAAAGCCATGGCGGCTATGAACATATTGCGGGCTTTCATGGCTATCCGAACCTGTGCCCGGAAAAA
GGCGATGAAAAATATCCGTGCTGCGTGCATGGCATGAGCATTTTTCCGCATTGGCATCGCCTGCATACCATTCAG
TTTGAACGCGCGCTGAAAAAACATGGCAGCCATCTGGGCATTCCGTATTGGGATTGGACCCAGACCATTAGCAGC
CTGCCGACCTTTTTTGCGGATAGCGGCAACAACAACCCGTTTTTTAAATATCATATTCGCAGCATTAACCAGGAT
ACCGTGCGCGATGTGAACGAAGCGATTTTTCAGCAGACCAAATTTGGCGAATTTAGCAGCATTTTTTATCTGGCG
CTGCAGGCGCTGGAAGAAGATAACTATTGCGATTTTGAAGTGCAGTATGAAATTCTGCATAACGAAGTGCATGCG
CTGATTGGCGGCGCGGAAAAATATAGCATGAGCACCCTGGAATATAGCGCGTTTGATCCGTATTTTATGATTCAT
CATGCGAGCCTGGATAAAATTTGGATTATTTGGCAGGAACTGCAGAAACGCCGCGTGAAACCGGCGCATGCGGGC
AGCTGCGCGGGCGATATTATGCATGTGCCGCTGCATCCGTTTAACTATGAAAGCGTGAACAACGATGATTTTACC
CGCGAAAACAGCCTGCCGAACGCGGTGGTGGATAGCCATCGCTTTAACTATAAATATGATAACCTGAACCTGCAT
GGCCATAACATTGAAGAACTGGAAGAAGTGCTGCGCAGCCTGCGCCTGAAAAGCCGCGTGTTTGCGGGCTTTGTG
CTGAGCGGCATTCGCACCACCGCGGTGGTGAAAGTGTATATTAAAAGCGGCACCGATAGCGATGATGAATATGCG
GGCAGCTTTGTGATTCTGGGCGGCGCGAAAGAAATGCCGTGGGCGTATGAACGCCTGTATCGCTTTGATATTACC
GAAACCGTGCATAACCTGAACCTGACCGATGATCATGTGAAATTTCGCTTTGATCTGAAAAAATATGATCATACC
GAACTGGATGCGAGCGTGCTGCCGGCGCCGATTATTGTGCGCCGCCCGAACAACGCGGTGTTTGATATTATTGAA
ATTCCGATTGGCAAAGATGTGAACCTGCCGCCGAAAGTGGTGGTGAAACGCGGCACCAAAATTATGTTTATGAGC
GTGGATGAAGCGGTGACCACCCCGATGCTGAACCTGGGCAGCTATACCGCGATGTTTAAATGCAAAGTGCCGCCG
TTTAGCTTTCATGCGTTTGAACTGGGCAAAATGTATAGCGTGGAAAGCGGCGATTATTTTATGACCGCGAGCACC
ACCGAACTGTGCAACGATAACAACCTGCGCATTCATGTGCATGTGGAT


frame 2 klh translation

FWCAKTFIAAAIMKRKNCAMRCINCRTMKAMAAMNILRAFMAIRTCARKKAMKNIRAACMAAAFFRIGIACIPFS
LNARAKNMAAIWAFRIGIGPRPLAACRPFLRIAATTTRFLNIIFAALTRIPCAMATKRFFSRPNLANLAAFFIWR
CRRWKKITIAILKCSMKFCITKCMRALAARKNIAAAPWNIARLIRILAFIMRAWIKFGLFGRNCRNAAANRRMRA
AARAILCMCRCIRLTMKAATTMILPAKTACRTRWWIAIALTINMITATCMAITLKNWKKCCAACAAKAACLRALC
AAAFAPPRWAKCILKAAPIAMMNMRAALAFWAARKKCRGRMNACIALILPKPCITATAPMIMANFALIAKNMIIP
NWMRACCRRRLLCAARTTRCLILLKFRLAKMATCRRKWWANAAPKLCLAAWMKRAPPRCATWAAIPRCLNAKCRR
LAFMRLNWAKCIAWKAAIILAPRAPPNCATITTCAFMCMWI


frame 2 klh translation with stop codons replaced with alanine

FWCAKTFIAAAIMKRKNCAMRCINCRTMKAMAAMNILRAFMAIRTCARKKAMKNIRAACMAAAFFRIGIACIPFS
LNARAKNMAAIWAFRIGIGPRPLAACRPFLRIAATTTRFLNIIFAALTRIPCAMATKRFFSRPNLANLAAFFIWR
CRRWKKITIAILKCSMKFCITKCMRALAARKNIAAAPWNIARLIRILAFIMRAWIKFGLFGRNCRNAAANRRMRA
AARAILCMCRCIRLTMKAATTMILPAKTACRTRWWIAIALTINMITATCMAITLKNWKKCCAACAAKAACLRALC
AAAFAPPRWAKCILKAAPIAMMNMRAALAFWAARKKCRGRMNACIALILPKPCITATAPMIMANFALIAKNMIIP
NWMRACCRRRLLCAARTTRCLILLKFRLAKMATCRRKWWANAAPKLCLAAWMKRAPPRCATWAAIPRCLNAKCRR
LAFMRLNWAKCIAWKAAIILAPRAPPNCATITTCAFMCMWI


frame 3 klh translation

SGAQKHS@PEPS&SGRTARCAV#TAER&KPWRL&TYCGLSWLSEPVPGKRR&KISVLRAWHEHFSALASPAYHSV
&TRAEKTWQPSGHSVLGLDPDH@QPADLFCG@RQQQPVF#ISYSQH#PGYRARCERSDFSADQIWRI@QHFLSGA
AGAGRR#LLRF&SAV&NSA#RSACADWRRGKI@HEHPGI@RV&SVFYDSSCEPG#NLDYLAGTAETPRETGACGQ
LRGRYYACAAASV#L&KREQR&FYPRKQPAERGGG@PSL#L#I&#PEPAWP#H&RTGRSAAQPAPEKPRVCGLCA
ERHSHHRGGESVY#KRHR@R&&ICGQLCDSGRRERNAVGV&TPVSL&YYRNRA#PEPDR&SCEISL&SEKI&SYR
TGCERAAGADYCAPPEQRGV&YY&NSDWQRCEPAAESGGETRHQNYVYERG&SGDHPDAEPGQLYRDV#MQSAAV
@LSCV&TGQNV@RGKRRLFYDREHHRTVQR#QPAHSCACG


frame 3 klh translation with stop codons replaced with alanine

SGAQKHSAPEPSASGRTARCAVATAERAKPWRLATYCGLSWLSEPVPGKRRAKISVLRAWHEHFSALASPAYHSV
ATRAEKTWQPSGHSVLGLDPDHAQPADLFCGARQQQPVFAISYSQHAPGYRARCERSDFSADQIWRIAQHFLSGA
AGAGRRALLRFASAVANSAARSACADWRRGKIAHEHPGIARVASVFYDSSCEPGANLDYLAGTAETPRETGACGQ
LRGRYYACAAASVALAKREQRAFYPRKQPAERGGGAPSLALAIAAPEPAWPAHARTGRSAAQPAPEKPRVCGLCA
ERHSHHRGGESVYAKRHRARAAICGQLCDSGRRERNAVGVATPVSLAYYRNRAAPEPDRASCEISLASEKIASYR
TGCERAAGADYCAPPEQRGVAYYANSDWQRCEPAAESGGETRHQNYVYERGASGDHPDAEPGQLYRDVAMQSAAV
ALSCVATGQNVARGKRRLFYDREHHRTVQRAQPAHSCACG



potential GST transcript

ATGAAACTGTATTATACCCCGGGCAGCTGCAGCCTGAGCCCGCATATTGTGCTGCGCGAAACCGGCCTGGATTTT
AGCATTGAACGCATTGATCTGCGCACCAAAAAAACCGAAAGCGGCAAAGATTTTCTGGCGATTAACCCGAAAGGC
CAGGTGCCGGTGCTGCAGCTGGATAACGGCGATATTCTGACCGAAGGCGTGGCGATTGTGCAGTATCTGGCGGAT
CTGAAACCGGATCGCAACCTGATTGCGCCGCCGAAAGCGCTGGAACGCTATCATCAGATTGAATGGCTGAACTTT
CTGGCGAGCGAAGTGCATAAAGGCTATAGCCCGCTGTTTAGCAGCGATACCCCGGAAAGCTATCTGCCGGTGGTG
AAAAACAAACTGAAAAGCAAATTTGTGTATATTAACGATGTGCTGAGCAAACAGAAATGCGTGTGCGGCGATCAT
TTTACCGTGGCGGATGCGTATCTGTTTACCCTGAGCCAGTGGGCGCCGCATGTGGCGCTGGATCTGACCGATCTG
AGCCATCTGCAGGATTATCTGGCGCGCATTGCGCAGCGCCCGAACGTGCATAGCGCGCTGGTGACCGAAGGCCTG
ATTAAAGAA


GST frame 2

&NCIIPRAAAA&ARILCCAKPAWILALNALICAPKKPKAAKIFWRLTRKARCRCCSWITAIF&PKAWRLCSIWRI
&NRIAT&LRRRKRWNAIIRLNG&TFWRAKCIKAIARCLAAIPRKAICRW&KTN&KANLCILTMC&ANRNACAAII
LPWRMRICLP&ASGRRMWRWI&PI&AICRIIWRALRSARTCIARW&PKA&LKK


GST frame 2 with stop codons replaced with alanine

ANCIIPRAAAAAARILCCAKPAWILALNALICAPKKPKAAKIFWRLTRKARCRCCSWITAIFAPKAWRLCSIWRI
ANRIATALRRRKRWNAIIRLNGATFWRAKCIKAIARCLAAIPRKAICRWAKTNAKANLCILTMCAANRNACAAII
LPWRMRICLPAASGRRMWRWIAPIAAICRIIWRALRSARTCIARWAPKAALKK


GST frame 3

ETVLYPGQLQPEPAYCAARNRPGF@H&TH&SAHQKNRKRQRFSGD#PERPGAGAAAG#RRYSDRRRGDCAVSGGS
ETGSQPDCAAESAGTLSSD&MAELSGERSA#RL@PAV@QRYPGKLSAGGEKQTEKQICVY#RCAEQTEMRVRRSF
YRGGCVSVYPEPVGAACGAGSDRSEPSAGLSGAHCAAPERA@RAGDRRPD#R


GST frame 3 with stop codons replaced with alanines

ETVLYPGQLQPEPAYCAARNRPGFAHATHASAHQKNRKRQRFSGDAPERPGAGAAAGARRYSDRRRGDCAVSGGS
ETGSQPDCAAESAGTLSSDAMAELSGERSAARLAPAVAQRYPGKLSAGGEKQTEKQICVYARCAEQTEMRVRRSF
YRGGCVSVYPEPVGAACGAGSDRSEPSAGLSGAHCAAPERAARAGDRRPDAR



possible bsa transcript

atgaaatgggtgacctttattagcctgctgctgctgtttagcagcgcgtatagccgcggc
gtgtttcgccgcgatacccataaaagcgaaattgcgcatcgctttaaagatctgggcgaa
gaacattttaaaggcctggtgctgattgcgtttagccagtatctgcagcagtgcccgttt
gatgaacatgtgaaactggtgaacgaactgaccgaatttgcgaaaacctgcgtggcggat
gaaagccatgcgggctgcgaaaaaagcctgcataccctgtttggcgatgaactgtgcaaa
gtggcgagcctgcgcgaaacctatggcgatatggcggattgctgcgaaaaacaggaaccg
gaacgcaacgaatgctttctgagccataaagatgatagcccggatctgccgaaactgaaa
ccggatccgaacaccctgtgcgatgaatttaaagcggatgaaaaaaaattttggggcaaa
tatctgtatgaaattgcgcgccgccatccgtatttttatgcgccggaactgctgtattat
gcgaacaaatataacggcgtgtttcaggaatgctgccaggcggaagataaaggcgcgtgc
ctgctgccgaaaattgaaaccatgcgcgaaaaagtgctgaccagcagcgcgcgccagcgc
ctgcgctgcgcgagcattcagaaatttggcgaacgcgcgctgaaagcgtggagcgtggcg
cgcctgagccagaaatttccgaaagcggaatttgtggaagtgaccaaactggtgaccgat
ctgaccaaagtgcataaagaatgctgccatggcgatctgctggaatgcgcggatgatcgc
gcggatctggcgaaatatatttgcgataaccaggataccattagcagcaaactgaaagaa
tgctgcgataaaccgctgctggaaaaaagccattgcattgcggaagtggaaaaagatgcg
attccggaaaacctgccgccgctgaccgcggattttgcggaagataaagatgtgtgcaaa
aactatcaggaagcgaaagatgcgtttctgggcagctttctgtatgaatatagccgccgc
catccggaatatgcggtgagcgtgctgctgcgcctggcgaaagaatatgaagcgaccctg
gaagaatgctgcgcgaaagatgatccgcatgcgtgctatagcaccgtgtttgataaactg
aaacatctggtggatgaaccgcagaacctgattaaacagaactgcgatcagtttgaaaaa
ctgggcgaatatggctttcagaacgcgctgattgtgcgctatacccgcaaagtgccgcag
gtgagcaccccgaccctggtggaagtgagccgcagcctgggcaaagtgggcacccgctgc
tgcaccaaaccggaaagcgaacgcatgccgtgcaccgaagattatctgagcctgattctg
aaccgcctgtgcgtgctgcatgaaaaaaccccggtgagcgaaaaagtgaccaaatgctgc
accgaaagcctggtgaaccgccgcccgtgctttagcgcgctgaccccggatgaaacctat
gtgccgaaagcgtttgatgaaaaactgtttacctttcatgcggatatttgcaccctgccg
gataccgaaaaacagattaaaaaacagaccgcgctggtggaactgctgaaacataaaccg
aaagcgaccgaagaacagctgaaaaccgtgatggaaaactttgtggcgtttgtggataaa
tgctgcgcggcggatgataaagaagcgtgctttgcggtggaaggcccgaaactggtggtg
agcacccagaccgcgctggcg


bsa frame 2

&NG&PLLACCCCLAARIAAACFAAIPIKAKLRIALKIWAKNILKAWC&LRLASICSSARLMNM&NW&TN&PNLRK
PAWRMKAMRAAKKACIPCLAMNCAKWRACAKPMAIWRIAAKNRNRNATNAF&AIKMIARICRN&NRIRTPCAMNL
KRMKKNFGANICMKLRAAIRIFMRRNCCIMRTNITACFRNAARRKIKARACCRKLKPCAKKC&PAARASACAARA
FRNLANAR&KRGAWRA&ARNFRKRNLWK&PNW&PI&PKCIKNAAMAICWNARMIARIWRNIFAITRIPLAAN&KN
AAINRCWKKAIALRKWKKMRFRKTCRR&PRILRKIKMCAKTIRKRKMRFWAAFCMNIAAAIRNMR&ACCCAWRKN
MKRPWKNAARKMIRMRAIAPCLIN&NIWWMNRRT&LNRTAISLKNWANMAFRTR&LCAIPAKCRR&APRPWWK&A
AAWAKWAPAAAPNRKANACRAPKII&A&F&TACACCMKKPR&AKK&PNAAPKAW&TAARALAR&PRMKPMCRKRL
MKNCLPFMRIFAPCRIPKNRLKNRPRWWNC&NINRKRPKNS&KP&WKTLWRLWINAARRMIKKRALRWKARNWW&
APRPRWR


bsa frame 2 with stop codons replaced with alanine

ANGAPLLACCCCLAARIAAACFAAIPIKAKLRIALKIWAKNILKAWCALRLASICSSARLMNMANWATNAPNLRK
PAWRMKAMRAAKKACIPCLAMNCAKWRACAKPMAIWRIAAKNRNRNATNAFAAIKMIARICRNANRIRTPCAMNL
KRMKKNFGANICMKLRAAIRIFMRRNCCIMRTNITACFRNAARRKIKARACCRKLKPCAKKCAPAARASACAARA
FRNLANARAKRGAWRAAARNFRKRNLWKAPNWAPIAPKCIKNAAMAICWNARMIARIWRNIFAITRIPLAANAKN
AAINRCWKKAIALRKWKKMRFRKTCRRAPRILRKIKMCAKTIRKRKMRFWAAFCMNIAAAIRNMRAACCCAWRKN
MKRPWKNAARKMIRMRAIAPCLINANIWWMNRRTALNRTAISLKNWANMAFRTRALCAIPAKCRRAAPRPWWKAA
AAWAKWAPAAAPNRKANACRAPKIIAAAFATACACCMKKPRAAKKAPNAAPKAWATAARALARAPRMKPMCRKRL
MKNCLPFMRIFAPCRIPKNRLKNRPRWWNCANINRKRPKNSAKPAWKTLWRLWINAARRMIKKRALRWKARNWWA
APRPRWR


bsa frame 3

EMGDLY@PAAAV@QRV@PRRVSPRYP#KRNCASL#RSGRRTF#RPGADCV@PVSAAVPV&&TCETGERTDRICEN
LRGG&KPCGLRKKPAYPVWR&TVQSGEPARNLWRYGGLLRKTGTGTQRMLSEP#R&@PGSAETETGSEHPVR&I#
SG&KKILGQISV&NCAPPSVFLCAGTAVLCEQI#RRVSGMLPGGR#RRVPAAEN&NHARKSADQQRAPAPALREH
SEIWRTRAESVERGAPEPEISESGICGSDQTGDRSDQSA#RMLPWRSAGMRG&SRGSGEIYLR#PGYH@QQTERM
LR#TAAGKKPLHCGSGKRCDSGKPAAADRGFCGR#RCVQKLSGSERCVSGQLSV&I@PPPSGICGERAAAPGERI
&SDPGRMLRER&SACVL@HRV&#TETSGG&TAEPD#TELRSV&KTGRIWLSERADCALYPQSAAGEHPDPGGSEP
QPGQSGHPLLHQTGKRTHAVHRRLSEPDSEPPVRAA&KNPGERKSDQMLHRKPGEPPPVL@RADPG&NLCAESV&
&KTVYLSCGYLHPAGYRKTD#KTDRAGGTAET#TESDRRTAENRDGKLCGVCG#MLRGG&#RSVLCGGRPETGGE
HPDRAG


bsa frame 3 with stop codons replaced with alanine

EMGDLYAPAAAVAQRVAPRRVSPRYPAKRNCASLARSGRRTFARPGADCVAPVSAAVPVAATCETGERTDRICEN
LRGGAKPCGLRKKPAYPVWRATVQSGEPARNLWRYGGLLRKTGTGTQRMLSEPARAAPGSAETETGSEHPVRAIA
SGAKKILGQISVANCAPPSVFLCAGTAVLCEQIARRVSGMLPGGRARRVPAAENANHARKSADQQRAPAPALREH
SEIWRTRAESVERGAPEPEISESGICGSDQTGDRSDQSAARMLPWRSAGMRGASRGSGEIYLRAPGYHAQQTERM
LRATAAGKKPLHCGSGKRCDSGKPAAADRGFCGRARCVQKLSGSERCVSGQLSVAIAPPPSGICGERAAAPGERI
ASDPGRMLRERASACVLAHRVAATETSGGATAEPDATELRSVAKTGRIWLSERADCALYPQSAAGEHPDPGGSEP
QPGQSGHPLLHQTGKRTHAVHRRLSEPDSEPPVRAAAKNPGERKSDQMLHRKPGEPPPVLARADPGANLCAESVA
AKTVYLSCGYLHPAGYRKTDAKTDRAGGTAETATESDRRTAENRDGKLCGVCGAMLRGGAARSVLCGGRPETGGE
HPDRAG