some sequence information for gst klh and bsa 4-1-13
2014-08-29azim58 - some sequence information for gst klh and bsa 4-1-13
potential klh transcript
ATTCTGGTGCGCAAAAACATTCATAGCCTGAGCCATCATGAAGCGGAAGAACTGCGCGATGCGCTGTATAAACTG
CAGAACGATGAAAGCCATGGCGGCTATGAACATATTGCGGGCTTTCATGGCTATCCGAACCTGTGCCCGGAAAAA
GGCGATGAAAAATATCCGTGCTGCGTGCATGGCATGAGCATTTTTCCGCATTGGCATCGCCTGCATACCATTCAG
TTTGAACGCGCGCTGAAAAAACATGGCAGCCATCTGGGCATTCCGTATTGGGATTGGACCCAGACCATTAGCAGC
CTGCCGACCTTTTTTGCGGATAGCGGCAACAACAACCCGTTTTTTAAATATCATATTCGCAGCATTAACCAGGAT
ACCGTGCGCGATGTGAACGAAGCGATTTTTCAGCAGACCAAATTTGGCGAATTTAGCAGCATTTTTTATCTGGCG
CTGCAGGCGCTGGAAGAAGATAACTATTGCGATTTTGAAGTGCAGTATGAAATTCTGCATAACGAAGTGCATGCG
CTGATTGGCGGCGCGGAAAAATATAGCATGAGCACCCTGGAATATAGCGCGTTTGATCCGTATTTTATGATTCAT
CATGCGAGCCTGGATAAAATTTGGATTATTTGGCAGGAACTGCAGAAACGCCGCGTGAAACCGGCGCATGCGGGC
AGCTGCGCGGGCGATATTATGCATGTGCCGCTGCATCCGTTTAACTATGAAAGCGTGAACAACGATGATTTTACC
CGCGAAAACAGCCTGCCGAACGCGGTGGTGGATAGCCATCGCTTTAACTATAAATATGATAACCTGAACCTGCAT
GGCCATAACATTGAAGAACTGGAAGAAGTGCTGCGCAGCCTGCGCCTGAAAAGCCGCGTGTTTGCGGGCTTTGTG
CTGAGCGGCATTCGCACCACCGCGGTGGTGAAAGTGTATATTAAAAGCGGCACCGATAGCGATGATGAATATGCG
GGCAGCTTTGTGATTCTGGGCGGCGCGAAAGAAATGCCGTGGGCGTATGAACGCCTGTATCGCTTTGATATTACC
GAAACCGTGCATAACCTGAACCTGACCGATGATCATGTGAAATTTCGCTTTGATCTGAAAAAATATGATCATACC
GAACTGGATGCGAGCGTGCTGCCGGCGCCGATTATTGTGCGCCGCCCGAACAACGCGGTGTTTGATATTATTGAA
ATTCCGATTGGCAAAGATGTGAACCTGCCGCCGAAAGTGGTGGTGAAACGCGGCACCAAAATTATGTTTATGAGC
GTGGATGAAGCGGTGACCACCCCGATGCTGAACCTGGGCAGCTATACCGCGATGTTTAAATGCAAAGTGCCGCCG
TTTAGCTTTCATGCGTTTGAACTGGGCAAAATGTATAGCGTGGAAAGCGGCGATTATTTTATGACCGCGAGCACC
ACCGAACTGTGCAACGATAACAACCTGCGCATTCATGTGCATGTGGAT
frame 2 klh translation
FWCAKTFIAAAIMKRKNCAMRCINCRTMKAMAAMNILRAFMAIRTCARKKAMKNIRAACMAAAFFRIGIACIPFS
LNARAKNMAAIWAFRIGIGPRPLAACRPFLRIAATTTRFLNIIFAALTRIPCAMATKRFFSRPNLANLAAFFIWR
CRRWKKITIAILKCSMKFCITKCMRALAARKNIAAAPWNIARLIRILAFIMRAWIKFGLFGRNCRNAAANRRMRA
AARAILCMCRCIRLTMKAATTMILPAKTACRTRWWIAIALTINMITATCMAITLKNWKKCCAACAAKAACLRALC
AAAFAPPRWAKCILKAAPIAMMNMRAALAFWAARKKCRGRMNACIALILPKPCITATAPMIMANFALIAKNMIIP
NWMRACCRRRLLCAARTTRCLILLKFRLAKMATCRRKWWANAAPKLCLAAWMKRAPPRCATWAAIPRCLNAKCRR
LAFMRLNWAKCIAWKAAIILAPRAPPNCATITTCAFMCMWI
frame 2 klh translation with stop codons replaced with alanine
FWCAKTFIAAAIMKRKNCAMRCINCRTMKAMAAMNILRAFMAIRTCARKKAMKNIRAACMAAAFFRIGIACIPFS
LNARAKNMAAIWAFRIGIGPRPLAACRPFLRIAATTTRFLNIIFAALTRIPCAMATKRFFSRPNLANLAAFFIWR
CRRWKKITIAILKCSMKFCITKCMRALAARKNIAAAPWNIARLIRILAFIMRAWIKFGLFGRNCRNAAANRRMRA
AARAILCMCRCIRLTMKAATTMILPAKTACRTRWWIAIALTINMITATCMAITLKNWKKCCAACAAKAACLRALC
AAAFAPPRWAKCILKAAPIAMMNMRAALAFWAARKKCRGRMNACIALILPKPCITATAPMIMANFALIAKNMIIP
NWMRACCRRRLLCAARTTRCLILLKFRLAKMATCRRKWWANAAPKLCLAAWMKRAPPRCATWAAIPRCLNAKCRR
LAFMRLNWAKCIAWKAAIILAPRAPPNCATITTCAFMCMWI
frame 3 klh translation
SGAQKHS@PEPS&SGRTARCAV#TAER&KPWRL&TYCGLSWLSEPVPGKRR&KISVLRAWHEHFSALASPAYHSV
&TRAEKTWQPSGHSVLGLDPDH@QPADLFCG@RQQQPVF#ISYSQH#PGYRARCERSDFSADQIWRI@QHFLSGA
AGAGRR#LLRF&SAV&NSA#RSACADWRRGKI@HEHPGI@RV&SVFYDSSCEPG#NLDYLAGTAETPRETGACGQ
LRGRYYACAAASV#L&KREQR&FYPRKQPAERGGG@PSL#L#I&#PEPAWP#H&RTGRSAAQPAPEKPRVCGLCA
ERHSHHRGGESVY#KRHR@R&&ICGQLCDSGRRERNAVGV&TPVSL&YYRNRA#PEPDR&SCEISL&SEKI&SYR
TGCERAAGADYCAPPEQRGV&YY&NSDWQRCEPAAESGGETRHQNYVYERG&SGDHPDAEPGQLYRDV#MQSAAV
@LSCV&TGQNV@RGKRRLFYDREHHRTVQR#QPAHSCACG
frame 3 klh translation with stop codons replaced with alanine
SGAQKHSAPEPSASGRTARCAVATAERAKPWRLATYCGLSWLSEPVPGKRRAKISVLRAWHEHFSALASPAYHSV
ATRAEKTWQPSGHSVLGLDPDHAQPADLFCGARQQQPVFAISYSQHAPGYRARCERSDFSADQIWRIAQHFLSGA
AGAGRRALLRFASAVANSAARSACADWRRGKIAHEHPGIARVASVFYDSSCEPGANLDYLAGTAETPRETGACGQ
LRGRYYACAAASVALAKREQRAFYPRKQPAERGGGAPSLALAIAAPEPAWPAHARTGRSAAQPAPEKPRVCGLCA
ERHSHHRGGESVYAKRHRARAAICGQLCDSGRRERNAVGVATPVSLAYYRNRAAPEPDRASCEISLASEKIASYR
TGCERAAGADYCAPPEQRGVAYYANSDWQRCEPAAESGGETRHQNYVYERGASGDHPDAEPGQLYRDVAMQSAAV
ALSCVATGQNVARGKRRLFYDREHHRTVQRAQPAHSCACG
potential GST transcript
ATGAAACTGTATTATACCCCGGGCAGCTGCAGCCTGAGCCCGCATATTGTGCTGCGCGAAACCGGCCTGGATTTT
AGCATTGAACGCATTGATCTGCGCACCAAAAAAACCGAAAGCGGCAAAGATTTTCTGGCGATTAACCCGAAAGGC
CAGGTGCCGGTGCTGCAGCTGGATAACGGCGATATTCTGACCGAAGGCGTGGCGATTGTGCAGTATCTGGCGGAT
CTGAAACCGGATCGCAACCTGATTGCGCCGCCGAAAGCGCTGGAACGCTATCATCAGATTGAATGGCTGAACTTT
CTGGCGAGCGAAGTGCATAAAGGCTATAGCCCGCTGTTTAGCAGCGATACCCCGGAAAGCTATCTGCCGGTGGTG
AAAAACAAACTGAAAAGCAAATTTGTGTATATTAACGATGTGCTGAGCAAACAGAAATGCGTGTGCGGCGATCAT
TTTACCGTGGCGGATGCGTATCTGTTTACCCTGAGCCAGTGGGCGCCGCATGTGGCGCTGGATCTGACCGATCTG
AGCCATCTGCAGGATTATCTGGCGCGCATTGCGCAGCGCCCGAACGTGCATAGCGCGCTGGTGACCGAAGGCCTG
ATTAAAGAA
GST frame 2
&NCIIPRAAAA&ARILCCAKPAWILALNALICAPKKPKAAKIFWRLTRKARCRCCSWITAIF&PKAWRLCSIWRI
&NRIAT&LRRRKRWNAIIRLNG&TFWRAKCIKAIARCLAAIPRKAICRW&KTN&KANLCILTMC&ANRNACAAII
LPWRMRICLP&ASGRRMWRWI&PI&AICRIIWRALRSARTCIARW&PKA&LKK
GST frame 2 with stop codons replaced with alanine
ANCIIPRAAAAAARILCCAKPAWILALNALICAPKKPKAAKIFWRLTRKARCRCCSWITAIFAPKAWRLCSIWRI
ANRIATALRRRKRWNAIIRLNGATFWRAKCIKAIARCLAAIPRKAICRWAKTNAKANLCILTMCAANRNACAAII
LPWRMRICLPAASGRRMWRWIAPIAAICRIIWRALRSARTCIARWAPKAALKK
GST frame 3
ETVLYPGQLQPEPAYCAARNRPGF@H&TH&SAHQKNRKRQRFSGD#PERPGAGAAAG#RRYSDRRRGDCAVSGGS
ETGSQPDCAAESAGTLSSD&MAELSGERSA#RL@PAV@QRYPGKLSAGGEKQTEKQICVY#RCAEQTEMRVRRSF
YRGGCVSVYPEPVGAACGAGSDRSEPSAGLSGAHCAAPERA@RAGDRRPD#R
GST frame 3 with stop codons replaced with alanines
ETVLYPGQLQPEPAYCAARNRPGFAHATHASAHQKNRKRQRFSGDAPERPGAGAAAGARRYSDRRRGDCAVSGGS
ETGSQPDCAAESAGTLSSDAMAELSGERSAARLAPAVAQRYPGKLSAGGEKQTEKQICVYARCAEQTEMRVRRSF
YRGGCVSVYPEPVGAACGAGSDRSEPSAGLSGAHCAAPERAARAGDRRPDAR
possible bsa transcript
atgaaatgggtgacctttattagcctgctgctgctgtttagcagcgcgtatagccgcggc
gtgtttcgccgcgatacccataaaagcgaaattgcgcatcgctttaaagatctgggcgaa
gaacattttaaaggcctggtgctgattgcgtttagccagtatctgcagcagtgcccgttt
gatgaacatgtgaaactggtgaacgaactgaccgaatttgcgaaaacctgcgtggcggat
gaaagccatgcgggctgcgaaaaaagcctgcataccctgtttggcgatgaactgtgcaaa
gtggcgagcctgcgcgaaacctatggcgatatggcggattgctgcgaaaaacaggaaccg
gaacgcaacgaatgctttctgagccataaagatgatagcccggatctgccgaaactgaaa
ccggatccgaacaccctgtgcgatgaatttaaagcggatgaaaaaaaattttggggcaaa
tatctgtatgaaattgcgcgccgccatccgtatttttatgcgccggaactgctgtattat
gcgaacaaatataacggcgtgtttcaggaatgctgccaggcggaagataaaggcgcgtgc
ctgctgccgaaaattgaaaccatgcgcgaaaaagtgctgaccagcagcgcgcgccagcgc
ctgcgctgcgcgagcattcagaaatttggcgaacgcgcgctgaaagcgtggagcgtggcg
cgcctgagccagaaatttccgaaagcggaatttgtggaagtgaccaaactggtgaccgat
ctgaccaaagtgcataaagaatgctgccatggcgatctgctggaatgcgcggatgatcgc
gcggatctggcgaaatatatttgcgataaccaggataccattagcagcaaactgaaagaa
tgctgcgataaaccgctgctggaaaaaagccattgcattgcggaagtggaaaaagatgcg
attccggaaaacctgccgccgctgaccgcggattttgcggaagataaagatgtgtgcaaa
aactatcaggaagcgaaagatgcgtttctgggcagctttctgtatgaatatagccgccgc
catccggaatatgcggtgagcgtgctgctgcgcctggcgaaagaatatgaagcgaccctg
gaagaatgctgcgcgaaagatgatccgcatgcgtgctatagcaccgtgtttgataaactg
aaacatctggtggatgaaccgcagaacctgattaaacagaactgcgatcagtttgaaaaa
ctgggcgaatatggctttcagaacgcgctgattgtgcgctatacccgcaaagtgccgcag
gtgagcaccccgaccctggtggaagtgagccgcagcctgggcaaagtgggcacccgctgc
tgcaccaaaccggaaagcgaacgcatgccgtgcaccgaagattatctgagcctgattctg
aaccgcctgtgcgtgctgcatgaaaaaaccccggtgagcgaaaaagtgaccaaatgctgc
accgaaagcctggtgaaccgccgcccgtgctttagcgcgctgaccccggatgaaacctat
gtgccgaaagcgtttgatgaaaaactgtttacctttcatgcggatatttgcaccctgccg
gataccgaaaaacagattaaaaaacagaccgcgctggtggaactgctgaaacataaaccg
aaagcgaccgaagaacagctgaaaaccgtgatggaaaactttgtggcgtttgtggataaa
tgctgcgcggcggatgataaagaagcgtgctttgcggtggaaggcccgaaactggtggtg
agcacccagaccgcgctggcg
bsa frame 2
&NG&PLLACCCCLAARIAAACFAAIPIKAKLRIALKIWAKNILKAWC&LRLASICSSARLMNM&NW&TN&PNLRK
PAWRMKAMRAAKKACIPCLAMNCAKWRACAKPMAIWRIAAKNRNRNATNAF&AIKMIARICRN&NRIRTPCAMNL
KRMKKNFGANICMKLRAAIRIFMRRNCCIMRTNITACFRNAARRKIKARACCRKLKPCAKKC&PAARASACAARA
FRNLANAR&KRGAWRA&ARNFRKRNLWK&PNW&PI&PKCIKNAAMAICWNARMIARIWRNIFAITRIPLAAN&KN
AAINRCWKKAIALRKWKKMRFRKTCRR&PRILRKIKMCAKTIRKRKMRFWAAFCMNIAAAIRNMR&ACCCAWRKN
MKRPWKNAARKMIRMRAIAPCLIN&NIWWMNRRT&LNRTAISLKNWANMAFRTR&LCAIPAKCRR&APRPWWK&A
AAWAKWAPAAAPNRKANACRAPKII&A&F&TACACCMKKPR&AKK&PNAAPKAW&TAARALAR&PRMKPMCRKRL
MKNCLPFMRIFAPCRIPKNRLKNRPRWWNC&NINRKRPKNS&KP&WKTLWRLWINAARRMIKKRALRWKARNWW&
APRPRWR
bsa frame 2 with stop codons replaced with alanine
ANGAPLLACCCCLAARIAAACFAAIPIKAKLRIALKIWAKNILKAWCALRLASICSSARLMNMANWATNAPNLRK
PAWRMKAMRAAKKACIPCLAMNCAKWRACAKPMAIWRIAAKNRNRNATNAFAAIKMIARICRNANRIRTPCAMNL
KRMKKNFGANICMKLRAAIRIFMRRNCCIMRTNITACFRNAARRKIKARACCRKLKPCAKKCAPAARASACAARA
FRNLANARAKRGAWRAAARNFRKRNLWKAPNWAPIAPKCIKNAAMAICWNARMIARIWRNIFAITRIPLAANAKN
AAINRCWKKAIALRKWKKMRFRKTCRRAPRILRKIKMCAKTIRKRKMRFWAAFCMNIAAAIRNMRAACCCAWRKN
MKRPWKNAARKMIRMRAIAPCLINANIWWMNRRTALNRTAISLKNWANMAFRTRALCAIPAKCRRAAPRPWWKAA
AAWAKWAPAAAPNRKANACRAPKIIAAAFATACACCMKKPRAAKKAPNAAPKAWATAARALARAPRMKPMCRKRL
MKNCLPFMRIFAPCRIPKNRLKNRPRWWNCANINRKRPKNSAKPAWKTLWRLWINAARRMIKKRALRWKARNWWA
APRPRWR
bsa frame 3
EMGDLY@PAAAV@QRV@PRRVSPRYP#KRNCASL#RSGRRTF#RPGADCV@PVSAAVPV&&TCETGERTDRICEN
LRGG&KPCGLRKKPAYPVWR&TVQSGEPARNLWRYGGLLRKTGTGTQRMLSEP#R&@PGSAETETGSEHPVR&I#
SG&KKILGQISV&NCAPPSVFLCAGTAVLCEQI#RRVSGMLPGGR#RRVPAAEN&NHARKSADQQRAPAPALREH
SEIWRTRAESVERGAPEPEISESGICGSDQTGDRSDQSA#RMLPWRSAGMRG&SRGSGEIYLR#PGYH@QQTERM
LR#TAAGKKPLHCGSGKRCDSGKPAAADRGFCGR#RCVQKLSGSERCVSGQLSV&I@PPPSGICGERAAAPGERI
&SDPGRMLRER&SACVL@HRV&#TETSGG&TAEPD#TELRSV&KTGRIWLSERADCALYPQSAAGEHPDPGGSEP
QPGQSGHPLLHQTGKRTHAVHRRLSEPDSEPPVRAA&KNPGERKSDQMLHRKPGEPPPVL@RADPG&NLCAESV&
&KTVYLSCGYLHPAGYRKTD#KTDRAGGTAET#TESDRRTAENRDGKLCGVCG#MLRGG&#RSVLCGGRPETGGE
HPDRAG
bsa frame 3 with stop codons replaced with alanine
EMGDLYAPAAAVAQRVAPRRVSPRYPAKRNCASLARSGRRTFARPGADCVAPVSAAVPVAATCETGERTDRICEN
LRGGAKPCGLRKKPAYPVWRATVQSGEPARNLWRYGGLLRKTGTGTQRMLSEPARAAPGSAETETGSEHPVRAIA
SGAKKILGQISVANCAPPSVFLCAGTAVLCEQIARRVSGMLPGGRARRVPAAENANHARKSADQQRAPAPALREH
SEIWRTRAESVERGAPEPEISESGICGSDQTGDRSDQSAARMLPWRSAGMRGASRGSGEIYLRAPGYHAQQTERM
LRATAAGKKPLHCGSGKRCDSGKPAAADRGFCGRARCVQKLSGSERCVSGQLSVAIAPPPSGICGERAAAPGERI
ASDPGRMLRERASACVLAHRVAATETSGGATAEPDATELRSVAKTGRIWLSERADCALYPQSAAGEHPDPGGSEP
QPGQSGHPLLHQTGKRTHAVHRRLSEPDSEPPVRAAAKNPGERKSDQMLHRKPGEPPPVLARADPGANLCAESVA
AKTVYLSCGYLHPAGYRKTDAKTDRAGGTAETATESDRRTAENRDGKLCGVCGAMLRGGAARSVLCGGRPETGGE
HPDRAG