專利名稱:在人類肝臟中表達的表達序列標簽的制作方法
技術領域:
本發明涉及生物技術領域,尤其涉及一類在人類肝臟中表達的表達序列標簽。
背景技術:
肝臟是人體內最大的消化腺。也是體內新陳代謝的中心站。據估計,在肝臟中發生的化學反應有500種以上,實驗證明,動物在完全摘除肝臟后即使給予相應的治療,最多也只能生存50多個小時。這說明肝臟是維持生命活動的一個必不可少的重要器官。肝臟的血流量極為豐富,約占心輸出量的1/4。每分鐘進入肝臟的血流量為1000-1200ml。肝臟的主要功能是進行糖的分解、貯存糖原;參與蛋白質、脂肪、維生素、激素的代謝;解毒;分泌膽汁;吞噬、防御機能;制造凝血因子;調節血容量及水電解質平衡;產生熱量等。在胚胎時期肝臟還有造血功能。
肝臟疫病分為肝炎、肝硬化、脂肪肝、肝癌等。現代醫學實驗證明,肝病病毒侵入人體后,并不直接引起肝細胞的損害,只是在肝細胞內吸收營養賴以生存,并在肝細胞內復制、繁殖。其復制病毒的“零部件”如表面抗原(HBsAg)、e抗原(HBeAg)釋放在肝細胞膜上,引起人體免疫系統對這些抗原物質產生免疫反應,這種反應造成肝細胞的損傷、壞死。免疫反應的強弱決定于肝臟受損程度及臨床癥狀輕重。這場由病毒引發的、免疫系統對肝細胞的戰爭,使大約25%的患者的肝臟成為戰火連綿的戰場,肝臟的損傷由此加重。肝病的危害絕不僅僅限于肝臟本身,它還可以引起其它多種疾病。常見的有(1)糖尿病;(2)胰腺炎;(3)膽道感染;(4)功能性腎衰竭;(5)膽汗性腎病;(6)腎小球腎炎;(7)腎小管酸中毒;(8)溶血性貧血;(9)再生障礙性貧血;(10)心肌炎和心包炎;(11)結節性動脈炎;(12)消化性潰瘍;(13)自發性腹膜炎;(14)性激素代謝紊亂;(15)甲狀腺功能改變;(16)肝性骨病,等等。肝病不僅對患者的身體甚至生命造成危害,而且對患者心理上的打擊也是十分沉重的。無論是肝病患者還是病毒攜帶者,在生活、社交、求職、升學等方面都會受到嚴重影響。
生物基因組中可轉錄表達的序列(即基因)僅占總序列的3-5%,對這部分序列進行測定,將直接導致新基因的發現,并獲取基因組中與產業化關系最為密切的信息。20世紀80年代,高通量的自動測序的出現,使從質粒互補脫氧核糖核酸(Complementary DNA,簡稱cDNA)文庫隨機選取許多cDNA克隆和決定來自非載體兩端的幾百個堿基的DNA序列成為可能。這些短的DNA序列叫做“表達序列標簽”(Expressed Sequence Tags,簡稱ESTs)。表達序列標簽的概念最早是由Adams等在1992年提出來的(Nature,355,642-644)。1992年Sikela和Matsubara(Sikela,et al.Nucleic AcidsRes.19,1837-1843;Matsubara,et al.Nature Genetics,2,173-179)針對獲得大量信使核糖核酸(mRNA)序列的迫切需要,提出大規模互補脫氧核糖核酸(cDNA)測序的研究戰略。隨后Venter創立了大規模表達序列標簽技術。其基本特征就是從以質粒為載體,構建完成的目的組織互補脫氧核糖核酸(Complementary DNA,簡稱cDNA)文庫中,隨機選擇許多cDNA克隆,利用質粒上攜帶的通用引物對cDNA兩端進行一輪脫氧核糖核酸序列測定,所獲得的來自3’端或5’端的幾百個堿基的非載體短脫氧核糖核酸(DNA)序列。簡而言之,表達序列標簽是來自表達基因片段3’端或5’端的短脫氧核糖核酸序列,代表一個表達基因的部分轉錄片段。
表達序列標簽可用于新基因克隆、人類基因組圖譜繪制、基因組序列編碼區的確定等。如果一個表達序列標簽在基因組中只出現一次,那么它可以作為序列標簽位點(STS)。由表達序列標簽構建的物理圖譜叫表達圖或轉錄圖(expression or transcript map)。利用表達序列標簽進行基因圖制作,可以加快序列標簽位點的制作和新基因的染色體定位。表達序列標簽可以作為基因特異性探針,對組織特異性基因表達的研究具有重要的作用。表達序列標簽還可以進行新基因的遺傳進化關系分析。表達序列標簽可以對所有動植物的基因作為一種數據庫,通過不同的序列比較可以獲得保守序列片段,從而獲得基因的遺傳進化圖譜。正因為表達序列標簽具有如此的優越性,因此表達序列標簽測序已經成為許多基因組研究機構的工作重點。
由于本發明人類肝臟特異表達基因與一些肝臟疾病相關,因此,研究人類肝臟中特異表達的表達序列標簽對探索肝臟疾病的發病機理及研制肝病的治療藥物具有重要意義。
發明內容
本發明要解決的技術問題是提供一類在人類肝臟中表達的表達序列標簽。
本發明要解決的技術問題通過如下技術方案實現本發明提供了一類在人類肝臟中表達的表達序列標簽的序列,其包括(a)SEQ ID No.1~SEQ ID No.50所示的序列;(b)SEQ ID No.1~SEQ ID No.50所示的序列中每條序列的互補序列;(c)與SEQ ID No.1~SEQ ID No.50所示的序列中每條序列有至少70%同源性的序列,及(d)上述(a)~(c)中一條或數條的組合。
較佳地,所述序列包括具有SEQ ID No.1~SEQ ID No.50所示的序列。
本發明還提供了一種探針分子,所述的探針分子含有上述序列中約8-100個連續的核苷酸。
由本發明的在人類肝臟中表達的表達序列標簽,可以方便的尋找出在人類肝臟中特異表達的表達序列標簽,從而尋找出人類肝臟疾病相關基因,從而在研究肝臟疾病的致病機理以及開發治療肝臟疾病的藥物中發揮重要作用。
具體實施例方式
下面結合附圖
對本發明作進一步詳細的說明。應理解,這些實施例僅用于說明本發明而不是限制本發明的范圍。下列實施例中未注明具體條件的實驗方法,通常按照常規條件如Sambrook等人,分子克隆實驗室手冊(New YorkCold Spring Harbor Laboratory Press,1989)中所述的條件,或按照制造廠商所建議的條件。
實施例1人肝臟組織的mRNA的分離組織分離(Tissue isolation)肝臟來源于5個成年男性,在肝臟切除手術后,將肝臟組織立即置于液氮中冷凍保存。
mRNA的分離(mRNA isolation)取出肝臟組織,用研缽研碎,加入盛有裂解液的50ml管,充分振蕩后,再移入玻璃勻漿器內,勻漿后移至50ml新管,抽提總RNA(TRIzol Reagents,Gibco,NY,USA)。用甲醛變性膠電泳鑒定總RNA質量。用帶Oligo d(T)的纖維素柱分離總RNA中的mRNA,定量。
實施例2cDNA文庫的構建(Constuction of cDNA library)以mRNA為模板,合成雙鏈cDNA。補平末端后,加含EcoRI切點的接頭。磷酸化EcoRI末端后,用XhoI限制性內切酶消化1.5小時,再進行片斷分離。過柱篩選長度>500bp的片段,用酚-氯仿抽提,乙醇沉淀,無菌水溶解,連接至Uni-ZAP XR載體(Strategene,CA9203,USA),以ZAP-cDNA Gigapack III Gold Cloning Kit(Strategene,CA9203,USA)進行包裝,宿主菌使用XL 1 Blue MRF’(Strategene,CA9203,USA)細菌。涂板并測定滴度。
實施例3測序及數據庫建立(Seqencing and Database Constructing)挑選文庫中有外源片段插入的克隆,擴增后抽提質粒(QiagenGermany),用T3和T7作為3’和5’端的通用引物,采用終止物熒光標記(Big-Dye,Perkin-Elmer,USA)的方法,在ABI 377測序儀(Perkin-Elmer,USA)上進行EST大規模測序。測序結果用FACTURA軟件去除載體序列,傳輸到SUN Ultra 450 Server上進行下一步的處理。所有的序列信息再用GCG軟件包(Wisconsin group,USA)中的BLAST和FASTA軟件搜索已有的數據庫(Genebank+EMBL),將無同源性或同源性低于95%的序列視為新基因建立數據庫。
實施例4基因的全長克隆(Cloning of Full-length cDNA)在得到的新基因片段序列信息基礎上,進行cDNA全長克隆,分兩階段進行(1)“電子克隆”(Electronic Cloning)以新基因片段序列作為探針搜尋dbEST數據庫,將重疊序列>50bp,同源性在98%以上的表達序列標簽(Expressed Sequence Tag,簡稱“EST”)序列認為同一序列(Consensus Sequence),取出并用AUTOASSEMBLER軟件進行連接,部分EST可以延伸探針序列。再用STRIDER軟件分析被延伸的序列是否具有完整的開放閱讀框架(Open ReadingFrame,ORF),用BLAST搜尋Genbank或SwissProt以確定該序列的核苷酸和氨基酸水平上是否與其他物種有同源性,以幫助判別所得到的基因全長完整性如何。通過電子克隆的方法,通常可獲取人肝臟相關基因的全長序列。
(2)cDNA末端快速擴增(Rapid Amplification of cDNAEnds,RACE)如果通過“電子克隆”方法仍未得到完整的cDNA全長,則在已有序列5’或3’端設計引物,在人類肝臟Marathon-Ready cDNA文庫(Clontech Lab,Inc,USA)中進行長距離PCR反應。然后對PCR產物克隆、測序。用AUTOASSEMBLER及STRIDER軟件分析被延長的序列有無完整的ORF,如無,重復上述過程直至獲得全長。
(3)RT-PCR對于5’和3’端的已知的序列,如果中間有一段間隙(gap)無法從已有的公共數據庫或自身數據庫獲得,可考慮采用RT-PCR的方法。在序列5’端設計引物,3’端引物采用Oligo-dT,在肝臟總RNA庫中進行擴增。然后對產物進行克隆、測序。最后拼接便獲得全長。
通過組合使用上述3種方法,可獲得人肝臟相關蛋白的全長編碼序列。
序列表<110>上海人類基因組研究中心<120>在人類肝臟中表達的表達序列標簽<130>NP-10039<160>50<210>1<211>387<212>DNA<213>Homo sapiens<400>11 atacagaaga ggagatctcc tcattgtata ttttatatct tatagaactt tcattaatga61 atcataggaa tataccagct cttcaaaaat taagacatta tacaaagtga aatactaaaa121 acagaaatat gtgtttttct caaataatat gcttctcagg aaagacttta cagaaacatc181 tcttcttatc cgaaattaca tccaaaccct ttttttgtgt gaagtgttgc ttccagtcta241 ctaaggtaat aagcaagtaa caagatagta ttgtttgata ctcttgaacg cactctgatt301 ttcaattcca attttgtttg aacctgtcag cattttggga tctgggcccc gtcatgtccn361 accgtcattt tattcttttg ggggacc<210>2<211>440<212>DNA<213>Homo sapiens<400>21 angacggagt gagggggcag ggtagttatt aattagaaga tacgggcaaa acactgggat61 ggcttcctga caacttaaga ggtctccgag ttatattctg ggttgggaaa cactgaccca121 gcccttattc cttcaaggac tctagtcatt ggcaaggagg attcatgagc cccggtgaca181 cagatggggg ccctgctcta tattcaactg tccagagaag atctagtcac aacccctcat241 ctcaaatgga ctttggttca gaaaacagaa ctaaggaaaa gtggtttcct tggactcaac301 cggtctgaat ttcccactaa ccacaacact gacttcctga ggcaagctgg gctaatccca361 cgtnttacag ggcaattact tgtacatcac agggctgggg gacaacacac accaagggnt
421 gaggaattnt ctaaaaagcc<210>3<211>258<212>DNA<213>Homo sapiens<400>31 attctgtgaa aaaattatat aattnataaa attttaagct gttgcatatn ccttgcttca61 acttagaata cacttacaaa gaagaaagtt aaataacttt gaggtctacc atgaaaattt121 aatgttttta atcattttgt tgtatatttt tatagctttc tcccataatt ccttgggttt181 ccttcacaga aggcttaaat gagtagtcaa gtatatatat gtctatntgg aattnacttn241 cctattaata acatgttt<210>4<211>339<212>DNA<213>Homo sapiens<400>41 ctgatccatg tagaggagga gaacagtgca taaaacatag taagcactca ataactgtta61 cttgatatta ctacttcatc agagtaaata aaaataagac accaggaatg aaagatttaa121 tccaaccttc tttcaaaacc aaaggcaaac tgttcattag ccaaacccca ccattccttg181 tttaaataat tgcgatcacc tcctacctac ctagccttgt agccagcagt ctctcccatt241 catctactgt catcagtcag atttattttc ctaaaaataa tatttaacat ttctagttca301 ggatatgatg aactaacttc ctgctgcaaa aaaaaaaaa<210>5<211>1299<212>DNA<213>Homo sapiens<400>51 caagttacgc ggactccgtg aagggccgat tcaccatctc cagagacaac gccaagaata61 tattgtttct gcaaatgaac gaattgagag ccgacgacac ggctgtgtat tactgtgcaa121 gaggggggga tcatatagta ccggctgctg tcgctccctt ccacatggac gtctggggcc181 aagggaccac ggtcaccgtc tcgtcagcat ccccgaccag ccccaaggtc ttcccgctga241 gcctctgcag cacccagcca gatgggaacg tggtcatcgc ctgcctgcct ggtccagggc301 ttcttcccca ggagccactc agtgtgacct ggagcaaagc ggacagggcg tgaccgccag361 aaacttccca cccagccagg atgcctccgg ggacctgtac accacgagca gccagctgac
421 cctgccggca cacagtgcct agccggcaag tccgtgacat gccacgtgaa gcactacacg481 aatcccagcc aggatgtgac tgtgccctgc ccagttccct caactccacc taccccatct541 ccctcaactc cacctacccc atctccctca tgctgccacc cccgactgtc actgcaccga601 ccggccctcg aggacctgct cttaggttca gaagcgaacc tcaggtgcac actgaccggc661 ctgagagatg cctcaggtgt caccttcacc tggacgccct caagtgggaa gagcgctgtt721 caaggaccac ctgagcgtga cctctgtggc tgctacagcg tgttccagtg tcctgccggg781 ctgtgccgag ccatggaacc atggggagac cttcacttgc actgctgccc accccgagtt841 gaagacccca ctaaccgcca acatcacaaa atccggaaac acattccggc ccgaggtcca901 cctgctgccg ccgccgtcgg aggagctggc cctgaacgag ctggtgacgc tgacgtgcct961 ggcacgtggc ttcagcccaa ggatgtgctg gttcgctggc tgcaggggtc acaggagctg1021 ccccgcgaga agtacctgac ttgggcatcc cggcaggagc ccagccaggg caccaccacc1081 ttcgctgtga ccagcatact gcgcgtggca gccgaggact ggaagaaggg ggacaccttc1141 tcctgcatgg tgggccacga ggccctgccg ctggccttca cacagaagac catcgaccgc1201 ttggcgggta aacccaccca tgtcaatgtg tctgttgtca tggcggaggt ggacggcacc1261 tgctactgag ccgcccgcct gtccccaccc ctgaataaa<210>6<211>204<212>DNA<213>Homo sapiens<400>61 gacagatagg acaaaattta taaacaaaat acaagagaaa aagcacctgn taagatccnt61 taatgacaac ttcaaagcag tgtttcatat tctcttacan aatagtgcca tccatntcca121 agttttgcca agtgtcacgt aaaactgtgt tcaatggtag actataaaag catctacatt181 tctttgccag agggttttgg anta<210>7<211>539<212>DNA<213>Homo sapiens<400>71 cttctanctt tgcaaacttt actgaaattt tctttcattg gatttttttt aaagtaagag61 taaatagaga acttcaggat aatctaatag gcgtattaaa gagaacacct tgttataaaa121 ataaagtatt acatgggttg agtgttttta ctttttaaat ttttcctgtt tcaatcagct181 tcttcagatt aaacagtata tttctgtaca catcaaaagg agacatacat gtatttgttt241 aataaaatta ctataggaca taagtactcc ttacagcaca gtctctgtta acagttgcat301 ggaatttaca catatttgag gcatccngcc attatctttg gggaggatat cccggggctt361 aaccaaattt tatccccagc cnggaaattt accggtggaa atttcctaaa actaggtcct421 cttnggggtt tccataaacc tggtaacctt aatccaattt aaatccanta ctctaccggg481 ttccctggct ccaaccactt gggggcttgg gggngaggct ggggggggtg tgaggcctt
<210>8<211>518<212>DNA<213>Homo sapiens<400>81 atgaaaagat gtttgatgtt tatttccacc ttgcactcag gtctgagcca caagtacatt61 aagacattga atggtatcac ccagggaata cgtaaccaga caacacacaa gactgagatg121 cacaagtggt ggtggtggta attcacgcag aaggaaccag acagtaaaac aaaaattgcc181 caacacacca aatgatcaaa tccgccacct ctaggatagg gcaaacttga ttgctggggt241 taaggaaccc tagaggtctg tttaaggtgg ggcagaggag gggttcttca gctttaggct301 tgtccctgac atcttaactt gcccaggcaa gcacttgtta catattaatt tttccttgag361 ggaacccagg tcctttaggt ggggggaggg ggtattccct ttttgaccca gatctttatn421 gggcttttaa tttggggtca gggtttttgc caaacctttc aaggccttcc cattaaccca481 agctaggggg ggnggggctt cttttattgc tttncagg<210>9<211>506<212>DNA<213>Homo sapiens<400>91 tttttttttt ttgctttccc tcttcttagc cccttcccta aatcctaggg tgntgaaatc61 tcaatatacc ttcttctttt cccagacttg gactaaaatt cccttcttgg tcttccatct121 ctgaagatta agagcctggn actagactgc ccctcagcct gagtcaagga ttctccatat181 ctgtgtagag tcttccaggc agcctgagga ggcatgccat ggntggttcc atgagcaaag241 atagtgagac ccataggant ttggagagac aaggccaccc tgaggaccag antgagagct301 ggcttcctgg agcaggtagg acttggattg agttaagaaa ggaaagtcta gagagtaggg361 aaggaagaca ttccagggca gaagaagtcc tttgagcaag gcactgaact gataatatga421 tatgaaacaa aaagtggtct cagaaagttt naagctaaga agttctaaat ggnttttttc481 agaagcaagt ttaggantta agtagg<210>10<211>495<212>DNA<213>Homo sapiens<400>101 cgtggggttn actgatggtg gctgctgtca nattccaagt ggcttatggg ataggacaac61 cccccaggca cttcactgta ggacagttag caccaagagc taaggttgtg agataatgca
121 aatctggcct gtcacctctg cagagtacag gttcccatac tntgaggcag cagcagagag181 ggaaccacca gagaaacagc ntttcagant tntctttcct ttggtntatg gatatgtgtg241 tgttctagtc tttggtgggc aatggantct gcagctccat gacaatcttg ttaagtagct301 tatgtgggga agtntttcag ggtcacaagg gccacccatt ctaaggcttc tcattttaat361 ttccccaggg nttaaggagg acaggtgggg ggaaagggna aaaaccttng cacctttgct421 attacttnaa ttngggattc caggaggccc aatccaatgg cattntttac cctacttttc481 ttgggcccaa atcca<210>11<211>447<212>DNA<213>Homo sapiens<400>111 ggtcttgctt caggttttct ttagaaatct gaggtttcac aaaacaggtt tctatgactt61 taactgcaca taaaaataaa tgtgttagtt tggtctccaa agctcttgaa tccattagaa121 taaactattc ccccaccatg atggtcctca ccgtggattt catccctttt tnatcctcaa181 ctgtactcac aaatacaatg ttaactttgc ctgctgcatc ctgcctttgc ccttcatctc241 tttgtgctgc tggagctgtc cacagcgatt taggggcaga aagagtacga aacagatcca301 gctttcttga aaatagactc ccttacctgt ctttatgcta gcttaccaaa aagaacggng361 gcttcccttt cccaccctgg gcaggacccg tctgcacaaa ggctctgggg agggcagccg421 gnatatctct aaggccccng ccttcaa<210>12<211>411<212>DNA<213>Homo sapiens<400>121 ttgagacagg gtnttggcct gttgcccagg ttggagaatg cagtggatga cacagcgaga61 ntccatctta aaaaataaaa tagantaaaa taaaaataaa tggccgggca tggtggcttg121 cacctgtaat cccagggcac tttgggaggc caaagcaggc ggatcacttg agaaatggaa181 aaaacaagtc acttacaaat gaggctgaga caaaggntcg gcggaaagag ccacagaaca241 ccaaaaacag aagcagagnt ggacaaatta agcaaaacct tacagttttg tagtttctgc301 ttttatctaa cacaatgact tcgntgtact tcatatacct tttganttat acctgcttat361 acataccttt accctaccta ctacttntac ctacctcgtg ccganttctt g<210>13<211>484<212>DNA<213>Homo sapiens
<400>131 ttgcagtctt ttactacagg aatgtgaaga tatatttaaa aatagaatac tttaaaatca61 ggaaatgtat tttgctctct tgaagaaaat agatatgatt tggcaaagga taaaattagc121 cataaatgat aaagattttt ttaacatctg aacttgtcat tagcactaag atgagctgct181 taccagggna ctaagaagan ttaacatcat caagtttaaa atcgatcatt ttaaatatga241 aggagattta aacatctcaa ccccatggan tcaattgtgc tggataacta tccngtctca301 ttgggatagt ggggctcnat ccaaattgag gggatattca gggaggttta aggaggatcc361 ngcacaggag gaggtgagga ccccaaatgg ctttttttct ntccattttc cattttncct421 gtttattaat ggcctggatg gaggttttaa gggggntttt ataggggggc tttttccttt481 tcca<210>14<211>489<212>DNA<213>Homo sapiens<400>141 acttcactta aatctgctct gagtctctgg atgcctggca ggtggagaat tcaatcttgt61 cgttaccagt attcctttcc cttctccatg ggcttatgta agaattctgg gcttacacac121 tgttggaaag ccaggtagga actacctccc ccgaactctc cattcttcca gctgctcatg181 atccatcaac cttctttggg ccacctgcta tagcaagacc ctcctcacag catcattcca241 ctgacccaca ggctcagccc cagggaccct cactaggaac aggtctccac tatgcatagg301 gaactcacaa aaaccttctc ttncatcttg ggttctggct ggatattcca ggccantccc361 ccanttnttc atntttaaaa cacagntggg caggttcctt tcccntcgtt tccaaaacag421 ggggggtttg ttccagccaa tttttctggc aggacaccaa agttttcacc cgttcctttt481 caggggagg<210>15<211>531<212>DNA<213>Homo sapiens<400>151 ttttttgttt tttgattttg ctttaatgat tttaattagt atccaacatc aattcttaat61 ctgtatacaa cagttctgga gtgacctgga taacaatttg tttgctcata caccttaagt121 actccccaaa tagaatgtat aaaagtacta atttagattg gttttgtgta atctgtgaat181 aaataatatt ttacagcagt tctctgggta atgagagtca ctctgtacag atagtaggta241 tgtacttaaa tatttataca tttccctctg agctagtaag tcattgaccc tgaagctggg301 aatgaggctg ggtttgcacc ttgggaaaac ttaatcgttc ccatgggtaa cacagaggac361 tcttgcccta ccactttctt ccccacatct ttggttatat ttaaaggcaa ggaaattggt421 tatttacttc atatttaata tggttggata cattttcagg ggggcacaat tccctaacct
481 caggcccatt cttgtggtaa taaaatattg ccctagggna acccnttaaa a<210>16<211>482<212>DNA<213>Homo sapiens<400>161 ttttttttac agagtgaata cagctttatt ggatgattta gaaaccacct ggaatcattc61 ttataattca aaacaccaac ctatcattat gataacagta cataagttcc agtattttcc121 actttgagag aaccctggtt tccttggctg ctttctaaag aggagaaaaa gaacacctaa181 cagggagata gatcaacaat gagcacattt nagggcatag gcaaaaagta gggagatcag241 ttgttatgtt atttaggtaa gaactgatac agtaacagtc aggagacaaa tagaaactnc301 agtgggactt nggagacact gcctaagtgg gaagggtgaa agggaagtag gagccatnca361 aaccaagctt aggttngacc atatttgatg gacaaagtcg taaatgggtg tttagggagt421 ttnttaaggn agttaaaagg ggaaattttt aagnggggtn aggctatttt gggttttttt481 tt<210>17<211>455<212>DNA<213>Homo sapiens<400>171 ttttcnttct ttcttttgag gtagagtctt cccttatctg ccaggctgga gtgcaatggt61 gcaatcagct taccacagcc tcaaattcct gggctcgagt gatcctccca cctctgcctc121 ttgagttgct aggactacag gngcacacca attttttttt tttttaagag acagggtctt181 gccatgttgt ccaggctgag ctcaagtgac tcttctgcct cagccccaca aagtgctggg241 attacanggc aagagcaccn gcacctagca tcactgatta ctttctaagg ccaanttccc301 ttttggccta agctgggact gtaaaattcc cttgtggggc anggaaattg tantgtctca361 atnttctttg tatctacaat taaaacctga acattacanc tagggngaat taaaagancc421 ctntccagng aacanggagt tctngcctta ngagc<210>18<211>454<212>DNA<213>Homo sapiens<400>181 ttttttgacc taatgaaata ccaaatttct tggnaggctt actggagaca atttttttta61 aactagactt aattcctgtt ttaattattt aaaattatta aatcaaacca cctgataagg
121 acagantaaa tttaaatata ttaatgtaaa ntttatccaa ctattganta ttggnctgca181 aacacttatt catgaaagng ataaattttc ggttatcaga tgggcagaca gaccacatct241 aaaaccacta ctgaaatgat actttanctg tcaggantct gaggtgacan ggaaanttaa301 aggccggggt cgggtgggct catggtctnt aaancccaac actttggggg agggccaagg361 agggggaggg ctcgcttgag gggcaggggg gttttggggg acaagcnggg gggccaaaac421 agggggggca cttctggttt ttcttacaaa agga<210>19<211>478<212>DNA<213>Homo sapiens<400>191 gnngggnngg tatataagtg gttggtggat tttaacacac cagttattcc tcctgtgaat61 gcctcagaga tggtggaaac tgaggctgat gggggaacaa ctttagctag ggaaggangg121 tattggttct cacatttgac ccacactcct cctgcccggg gttnttcctc tctgcctccc181 ttccctacac tgctgctgtc ctctttttga catttctatt tatttatttt tctgccctct241 cagatcagca ggagagcacc taggtctcag ccagctctct atggcaggcc tctgcagcag301 cacttgtgaa gtgctagtga ttcacttcct tctctattgc tgacccttct ctttcttngg361 caggcagtcc aggggagtgg aaggaagggt cggagtataa gcagggaaag ccagggnact421 tcctcacatt ttcactngga agcacttctn cctcttattg gcaggncaag ttttttaa<210>20<211>386<212>DNA<213>Homo sapiens<400>201 ttttgtcttt taacctacac ctttatcatt actctaacag atttagggct tctctttctc61 tacagctaag taagggaata tgtgcaatta tgagacatac aaaaaaggaa agggaaagga121 ctttctaagt agcaaatctg tgccatgaag tagatgtggc gtgaagatac agagcctgag181 gatagtaatt ttccctgagc cacgcacaca ggcttttatt tcatgccttt tctctttctg241 tgccgtcacc tttgaggaaa aacgattgca ccttctccaa gtctggcctt tttaacagct301 acagttaagt tgggccaagg actttcccca gctctggaat ataggccctt ttgcccgact361 nccggncttt ttttgcggag gactgg<210>21<211>418<212>DNA<213>Homo sapiens
<400>211 ttttangttt ttaaagactc tgaactcatc atacaaattc aaatgcttta ttggagagta61 cctaatttga attaatcagc tagtctcttc ctagaagtct taaatacctt ttttatttta121 ttttatttta ttttattttt tttaagacag tctttctctg tcacccaggg ctggaggaca181 gtgggtgcaa tcatagctgc actgcagcct ctaattccng tgttcaatcg atccctccca241 ctcaggctcc taaggagcta gggnctacag ggngtgtgcc acccacaacc caacttaatt301 ctttgtagga gaagaggggg nctctctatg gttancccaa tctgggggcc ttnaggtttc361 tggggacttc aaggcnaatc ctcccggcct ccccaagtgg gggggggggg tttnaaaa<210>22<211>311<212>DNA<213>Homo sapiens<400>221 aaacctatca tgttaatatc ttaaaaaaag aaaaaataga ggaaaaaaat ctttaagaac61 acttaaccat gtttgttatt acatgttttt tgagaccaga ttttatgcca tctgtgttaa121 cccccccaca taagtctaca tcaagtattt ggttaaaagt gctcacagca ggaatttact181 gaggatttaa tgggaagtgt tctaaaactg ggattgttgt aatgcacaac tgcacaagtg241 gggtaaattt acaaaggaaa aatgtacaca taaaatgggg tgaactttat gggnatttta301 aaaattacag t<210>23<211>345<212>DNA<213>Homo sapiens<400>231 ctatattttt aagagatgta catcctagtc acttgtaatc ataagttgtc tttaactcag61 ctggctccag taattaaaaa aaaaattggg gagatgagga ttgtttaaaa caaatgttaa121 tgttttaaag ggaaataatt tccttacagt ttatttttat gtttctgctt tttagaaaga181 aacctttaaa aaggatatgt atttttgcca tattcaaaat aaggacttca gtatttatgg241 gctagggggg tattaaggaa aggtgggatc tgggcccggg cacgntggcc tccaagcncg301 ttaatccccg gcacttttgg gaaggcccag ggtgggccag atcca<210>24<211>320<212>DNA<213>Homo sapiens<400>24
1 tcttnnnctt cagttacata tggccaaatt taatggcact gaaaaggcta acatattaag61 atggaaaaaa tttacaagta atataaacat tataatctca attttgtttt taaaaataac121 ataactttac aggttaggat aaaatgcacc aaatttaaca gtgattacct gaaggtaaca181 gaatcatagg caatctttat tttttccttg cggtacattt ttatggatat atacaggtat241 tttttcataa tcngggaaaa aagggccttg gaaatttata atttttaaaa ggncctctcc301 ccncccaaat agggccccat<210>25<211>1303<212>DNA<213>Homo sapiens<400>251 ctcaacatgc caagagccag atgaatagtt cacacaatct gaatttaact gaaaaaaaga61 gatcatgctt tgaagcaggc tagactggag ctggacttta gtggataggt cggattctga121 aaaattagag cacattttag gaagagcaac agcacaaatt attatgttat caccatcttt181 gtctttttat ttccttagat ttacaagcac tagagattgt cctatgtgac cccgtttcct241 attttgctgg gaaaatggga gcaattagaa atctttcact tgtttccatc caccaccacc301 actactttgc ccattcccat tactctgcct tcctagtact ggaattggac tgtttctatt361 cctacttaaa gccagcctct ccatttgttc atcagatccc ttcttacggc ctatcaagtg421 ttgtttcagc aactctccct cttcagtctt ttttccctcc tgactgcatt cccattaatg481 tataaacatc ctcccattaa aaaaaaacaa aaactcaaat cctatattcc tttctaccta541 tagccccatt tctctgtttc tctttaagga aaactcttga gttttcgata ttcatatctc601 aattcctcgt ttttcattct cttttaaatc atgctgtgag gctccaaact catactgata661 aatctaattc tcatcttact tgaatcagca gcagcatttg acacatgatc aatccacttg721 aaatattttc ttcatttgac tctgaggaca catgcattca tagttctaat tcccatctgc781 attctacaca tcacagcagg tgcaccaaac cacacctgct gtggactaaa cagacttgga841 tggcagagca ggatccatag gcacacaagc taggtgtcac attcattgat taaatttttg901 gatttcactt atcaggctag gttgagttac agcaggtcca cagatagtgc tgttttgttc961 aatgttactt tgttataagg ctgatgagga aaaaaaggat tgccagacca gggctactgt1021 ctgtgtggag tttgcacgtt ctccccatat ctgtgtgggt ttctccaact attctgttct1081 tccacatcca aagatgtgca tgttaggtga actggtgtgt ctacatggtc ccagtatgag1141 tgtggatatg tgtgtgtgtg ccctgagatg ggatggcggc ctgtgcaggg ttggctcctg1201 ccttgtgccc tgagctgccg ggataagctc tggtcaccag tgaacctgaa ctggaataag1261 tgagtcaaga attatcttgt ttttattaat caatgtatgt aca<210>26<211>605<212>DNA<213>Homo sapiens<400>26
1 tcaaatttaa atttttagaa tgatggtgct actatattct aattgttaat ttgataattg61 taattgtaat ttgttaattt gttctaataa tattttcatg acattattca tttgctgctt121 ggcttgatgg cataacaagt atgggttatt taccctgaaa aggaaacagt atcacccttc181 cagaatctct atttacagct acagaagatg caacacacct gtttctggac cacttccaga241 tcagatgtga tgcatgcttt agatcattta ctttgatcta ggaattactg tctggcattt301 cgtggtgtag ggtaggtgtc tccagcactg tataggtgct ggaattgaga taagaatcaa361 tttcaataca aaatacattt tttgcttaaa nacgcaagga tntatgaatc aggtaaatna421 cagggtggaa aacaggaact aatgttatat atnctgtagt tactcataat gaanagaaag481 aaaccaaaaa gttccaaaat anatgaaaga ctgtattnct agaatgaaaa tatctttaga541 tgtgatagag ttcanacctn aaaagnatgt gagagcccat gnccccagat tgttccncca601 cccag<210>27<211>1232<212>DNA<213>Homo sapiens<400>271 gcaggtgatc cacccacctc tgcctcccaa agtgctggga ttacaggcat gagccaccgc61 gcctggccag aagaaatctt tatcttggtg tgcagtgtct ggtgagggac aaatatcatc121 tctcttggat ctgaatctgg aagaattcgc ggcactgaag ggattttttt ttttcagaca181 gtatccctct gtcgccaggc tggagtgcag tgccgcaatc tcagctcacg gcaccctctg241 cctcccgggc tcaagggatc ctcctgcctc agcctctcga gtagctggga ctacaggcac301 gcgtcaccag gcccagctaa tttttgtatt tttagtagag acagggtttt accacggtgg361 ccaggatggt ctcaatctct tgacatcatg atccacctgc cttggcctcc caaagggctg421 ggattacagg cgtgagccac cacgcccggc ctcactgaag ggattttttt aatgtcacgt481 ggctctcaca ggtgcggtgt gtttgggtgc aagtgaagat tacgactgat gcttaaaaac541 aaatgtaaaa ttccaggtgg ttgttgctat ggggagcagc tttaggacaa tctgagtggt601 ttcagttgca agagtgtgcg tgtacgtcca agtgctacag tcaagattca actgctggct661 ttgagggcct ctttaagaac agtaatgata acctaaggca gtttaacagt atggaatggt721 taccttttag aagttaagct atgggcatgg aagttcaatc agtgcattga agtttttcct781 ttatctctcc tatggttaat ggtttctgca gaaaaggacc aattgatttc tttctaaaac841 gttgcttcag ggtgtagaga cctttatagg tcatgtttca acttacagaa aatttttata901 gttcaaatat aaattaagtt caatgtggaa tttgtaatag aatttcaggt gaagtaaaat961 ttccactttc cttaggctgt ttgcagtgcc cagcaggccc catgatatcg agatggaagt1021 tctgttaaag gaggagattg ctcagggatg ggcagaataa ggaatatggg cagctcaggc1081 taatgataca atgattgaga tgtagaaaga gggtcaggca cgggataacg cctgtaatcc1141 cactggcttg ggaggcccag acaagagaat cgattgaggt cagaccagcc tggtcaacag1201 agtgagacct aacctgtaca ataaaaaaaa aa<210>28<211>796
<212>DNA<213>Homo sapiens<400>281 gntgggaaaa gtgtactaca tgaaggatgc ttggggcttt gctaacctgc tcaccgctaa61 ccctccaagt ctaaccatcc ccagaggcca cacaaaccaa gtgactcctg tagttcccat121 ttgccccact atggagtcag ggnaaaagtg gaatagccac tccatgtgat tttagcatgt181 ttaatcattt taacaataaa ccaccccaca aatggggtca tttancctct aaaaaaaaaa241 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaannnnn301 nnnttttttt tnnnggnnnt ttnccggggg aaatttaana anccccncnn nnnnnnnnnn361 nnnnnnnntt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnttnnnnnn ntnnnnnnnn421 ncnnnnnnnn nnnnnnnnnn nnnnntnntn nnnnnnnnnn nnnnnncnct ttnnnntnnn481 nnnnnntnnn nnnnnnnnnn ntnnntnnnn nnnnnnnnnn tnnnnnnnnn tnnnnnnnnn541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnntnnnn ncnnnnnnnn601 nnnnnnnnnn nnnnnnnnnn nnnnnnntnn nnnnnnnntn nnnnnnnnnn nntnnnnnnn661 nnnnnnnnnn nntnnnntnn nnnnnntcnn nnnnnnntnn nnnnnnnnnn nnnnnnnnnn721 nnnnnnnnnn nnnnnntntn ntnnnnnnnn ntnntnnnnn nnttnnnnnn nnnnnnnnnn781 nnnnnnnnnn nntnta<210>29<211>453<212>DNA<213>Homo sapiens<400>291 tgtgaatagt aattataatt aaccctttct aaaagaaaag gtatatgtaa tagtacataa61 tagagtaaca attgtaactc aagtcttttt ttttcctttt tactcaaagc ataaaccaga121 cacaatacgt tataccagat taacatgcaa cttaaaaaga ggttaatgta catactaagc181 atggaaattc accagtaaaa tcttatttgt cattttaccc ccactacctt gacattattc241 ttccatttca taatcaacaa ggttctactt ctggtcacct gataaatgcc ttgcattcct301 ggagcttgac atttcagtca attatnggac acaagaacaa tgttacaagc ataaagaaaa361 aatgcatcta cgaaacagct ggactgcact gctttcaatg gaaaggtccc cactccaacc421 tgcccactcc ctcccttttc tcagcacaaa cat<210>30<211>611<212>DNA<213>Homo sapiens<400>301 tttttcggtt ttgctacact ttaatgggtt ttttttttaa gggatttttt ttcaggtctt61 gtcagcaaca tcaaacaaaa ggtactgagt actccacagg gtacagagtg ctgccaaaca
121 ccttagaaaa attacatgac acggagaaaa tgcgcctctt gctccttgaa gagcttacag181 tctagggatt tgacaactca cagtcttagg aactgggcaa agtaaggcaa attcttcatc241 ccctagagct attgtggact gaatcatttt agaatttgga attaatccaa tcaagatgag301 agacaagact aaatttggct gagaattcat tcaggctccg catagttttt attaacatcc361 gtctagtaaa cagaatggac ctaacagaca actggaaagt aaagactaga tcccttgaag421 tgccagggct acaacaacct taattgtggg ttacttattt ttaaaaagca aacataccga481 atggtatgac tagggtgatt acccagttta aaaataggcc aggtactgac actggcattc541 ccctcatgca ttgcccattt aaaatagngg atattaaata tgtgggctta aacncccggg601 ccgaatctgg c<210>31<211>579<212>DNA<213>Homo sapiens<400>311 aaatatttaa attttattct taaatactat ttattaatac agagaaatac tcctgctata61 attttaagtg acacaatagt acaacaattg ttaagaacaa tccacctgtg tcaaagagac121 acatacacaa acacacatcc atatacacat acaagcttcg gagaaaattc tgaaaggatc181 tcattgtcct agatggtggc attatgggca attattttgc aaatgtcacc cagtgaagaa241 gcaatggtgc ctttggaaag tatgtagctc ttagttaagc ttttatctgg aaggtgctgg301 agaaggtctg ggcacagagc tggttcggct ccatnggttt tgtgacctgt gccatcacat361 aggaccccat gttcagaagg accccactct tgggtttaat gctctgctgc cgtgtcttga421 aattcttaat aatttttttg aacaagggga tgctgcattt tcactgtgtt ctggggccta481 atgattatgt agtcaatctt gnctggcagc tctnctctgg cccatcactg gtagatgatt541 gtgggggtgg ttantccttt aaaatttcca tttaaaaaa<210>32<211>609<212>DNA<213>Homo sapiens<400>321 cccatttgga acaagtgccc atcaaaagct taggagatat aaaagatgta caagaacaac61 tccaagatcc cttttaagag tctttcatct ttctgtaaaa actaaaccaa gatacataac121 atatagagtg tcacatatcg catgctaatt cagaagcaag ctatatgaag caataggaaa181 cacagagccg tcagagcggg aaagccctgg aggcacgagt cagcgatgac accacagggt241 attcaaggag caccaatggc ctgcaggtgg cagaaacaat ggaggaacca gcctgagtca301 aggaggactc actaggtaag cttgggaaat agggctgaag acgtggactg agcagtcgat361 gggaaggctc ggaagtcaga catggctcac ctgcgacagg gggcctcagc aagagacagg421 tatgagggac actgttttag gcccatggga ttaaaaacac ggctgccatc aacatttgat481 ttcagcctat gtaaatggtc cggtgcaccn gtttattctc aacaagaact ttaatccagg
541 gtggctccta ccacctcnat tcctatcatc aaaatgggtc aaatttggcn aagccccgnt601 gtttacata<210>33<211>723<212>DNA<213>Homo sapiens<400>331 tcggaggtaa aaatttaatc cccgttttac aggtgagtga agtgggataa aatggcataa61 ttcactcaag atcataagct attgagtagc aaagacaaga gttaagcata aagtactcaa121 gaacctgtgt tttacattct atctgaagtg caaagaagaa tacctgaatg tcaaatgcag181 ccagaactct gggataagac ccgaggaaca agtcctgcag cagcagtgct aggaggcaga241 agaggaccat gggttcacat gctcaggaag agccatcagt aacaacaatc aaccacctac301 tgagactctg gccatgggtc tccccggcta taggggtaag gtcggaggtt taaataaatg361 gcctctggcc attccactgg gcaatacaat ctaagggaag ctggccaaac tttactggaa421 cccttggcat gtatctgtat aatgcaaatg tcattataca tttacctcac tttggtatat481 atgtatacat tatacatatc cctcacnttg tcaggaattt aaatgtgtaa aaatttacag541 gctgggcatt gtgggctcac acgntgtaat cccatttccc gacnaacggg gttggataaa601 tgccggatan tccatataaa gggctttcnc ctggacnctt gggttcaagg cnggtttaac661 cccggagggg gattaccggt ttccgttttc ccagantttt nttcccataa tctantgctn721 ctt<210>34<211>648<212>DNA<213>Homo sapiens<400>341 ttttataaaa gcttgttttt ctttattaga atactttttt caattctgat ttgtcacaat61 ttagattctt tttctaagaa taagcagaaa tttacaaaat ttaattttta tttatacatt121 catccgttca atacacattt caagaaagct gtattgcacc ctttcagtgg tagttacagg181 acaaagaaac aaaataatcc aagagagaga ccaacaaatg tatatttata acacagagta241 ataaacacaa ataaatgtgg agttatttaa gcatgtaaga tggtacatgc tctaccaggt301 atgggggctt ctctaagaca caagatcaga ttaaagtctt gaaagataat ctggggttag361 tcaagcaggt agaagtgtga agagcacttt ctgagcggaa tctccatgtg ccaagtctag421 ttcaagagac tggatagaga ttagcaattt ggagaatgca atattaagaa atggcaaagg481 tgagcctaca gagattccaa agttgcatag tgaggtttgt gtttgagaaa gtggccncag541 gcagccccca aaagagggtt aatagaaagg agctgaggtg gaaatcaggc ttgtgttcan601 cagtgtggca nattgggagc atgggagatc tgnctttnaa ggncaagg
<210>35<211>466<212>DNA<213>Homo sapiens<400>351 gtgagccagg ctacatgcca ggctttatgc tactgtgttt atttctcttt gttacttgcc61 agcacagtga tactagaact tgcagaaatt taagggtaac cggctattac tttaaaatac121 cttttcccag gctcccactg cagaccttca aaatccatgt cccccaagat tctgtatatt181 taaactggtg tccagagcag tttctatgac ctcaaaaggt tgggaaacat tgccctcatg241 ttcagaaaag tcctttaaac ttttcagctt tgacttagtt gtggagagat gttgaaaaat301 gccttttgtc atcccattgc ctgtacctca gatatgaagg aagaaatgac ccaaggaggc361 ttcgatggga gccaaaatca tcagagggga gagtctcggg agaaagatga atctccaggg421 caggggccaa gggtttggtg ggtccccccg gctcctggcc cctgca<210>36<211>487<212>DNA<213>Homo sapiens<400>361 tntcaagatg caaaatngtt cgataagctt gagtctctcc cactctnggc tgcaaaaatt61 aaaaagggct acaacaaaat tatagactgg aaatttgtga ccaaatacag cacaggattc121 ttagactgca aggaagacag cttcagagaa aggaacaagg agagtttact tacacttgtt181 tataccccta aggagggctt ttttgtacat atttgttggt aatattttga agatgtgaag241 tcaatgcact ttaatccgct ttggtatcat ggtcccttcc agccttccac ctctgaggct301 gagccatnga ggcctctttg ccccagggga cagtaggaca gttctgcctc cagcaaaggt361 gtgatggatc agctttactc acaaagctga atttctggag cacgcagcaa gatcatgatt421 ttgaaagcca ggctttggga ganggaaatg tgttttcnaa gtttctttag cttccatgaa481 atataaa<210>37<211>461<212>DNA<213>Homo sapiens<400>371 ttttaagagc gccanataga tttaaagtcc tttngttaat ttaggaactt aaaaccgtgt61 gatgtatagt ttggtttgct cgtaaaatta ctaagtgtgg catatttagt gctgtgaaag121 cttggggtaa aaggaaaaaa atcagtgaga attttcttac aagtctttag gggaaaatgt181 atttttccgg ctttttccca cacatgccag gccctccgca nctggggttg gcgctggtcg241 ttaccctcag ccccaattac agctgtgctg gaagctggga gtgggtgctg gggctggggg
301 attgcgggag tagggcnctg gtggggtttt ggtaaaacca ggtncaggnc ttatctccag361 agggggancc cancccattc tgggagctca nccctgncgg aaacccaggc cctnaagant421 cgggcagtnc atctctgntc cctcccacag tgagacaagg g<210>38<211>544<212>DNA<213>Homo sapiens<400>381 tttttttttt cacgtgaaaa aaataattta ttacagactc ttttacacat taacatggaa61 catttataca tatatcgatg tgctgatatg aaatactaaa tttaaaggca aacattttta121 cacaaaagta gttgcactct attttataaa gatagatatt aataagttat cagagacatt181 taagagctag aggccaatta ttccaacagt aatgcattct atgctgaaag taaactaagt241 tttctgaaca tgatgtcctg gatataatca cattcttcta agctaaggaa agggagctca301 tttctgggaa tacaaggcca agaagggctc taacagcagt atcccagcag tgtgtttcca361 gatttattct tggatgtggt tggagcgccc aacatttagc ctgaactaat gtaacagctc421 aatgtgaaac aatggcagct ttctggtaan agctgcctgt gggttaatga gatttaatac481 agggggatac agttaccaat gatagncttt taggaggaat tataattggc catatgattt541 ggaa<210>39<211>481<212>DNA<213>Homo sapiens<400>391 atttatatat acctcttcna ccattcattg gctggagtag ttgaacagac atcagcaggc61 acaatgctag gcaggcactg gagattcagt gatgaataaa acaaaactga cctggaacct121 ggcctcctga agcttctctt catctgtcta aattcctttc tttcaaggcc cagtgcaagt181 gacaccttca tcaggaaggg cttccacatg attctatctg caaatctccc ttttctgaac241 tattatgctt atagtctgta ccaggccaca tgacatatgt tattttgtgg cattctagaa301 ttattccaat cttttcattc ttataattgg tgtgttgtat ncaataaata atgttgttat361 tccagttttc atcactggtc acgcaagagt cctctcttna ttatatttta tccccatagt421 tcaccccgtt tcctccccat cctaagtaac tatgaccata cctaaatata natctaatta481 g<210>40<211>325<212>DNA<213>Homo sapiens
<400>401 aacttcaact tgttcacaat agctgagagt tactagattt taatctagag ttactggatt61 ttaatctcta gagttaacta gatttaaaac tagagttacc actgcccatt ttataaaccc121 ttaggtgata atgagaggca caaaggaatg ctgcagaaag acacatccat ggaggctgag181 ttttgcaaat aaaacctgct tcaacatgga accctccagt agcctttatg taaatgtagg241 acttaagtgg aaagagaagg aggataaata ttctagcagg tgctgcagaa tgcactgaat301 aggctgaaat gacacacgtt ttttt<210>41<211>418<212>DNA<213>Homo sapiens<400>411 taaatagttg tgaacataac tattgtgaat actatgttat atgttatagt tgtgaatgta61 aatagtattc acaattgtga atacaatgta aatagctttc acaattgtga atacaatgta121 agtagctatg taaatagtga atacaatgta aatagctatg taaatagttg ttatgctgat181 tagggaataa tgagaaggaa naaaagtata catattcagt acagatgcag tcatcttttt241 ttcccctcga acatttttga tccacaattg gttgaatcca cagatgggga acacacagtc301 ttggtaaatt taaccaacaa ggagggtaaa cgcatcccaa cagggaaggt aaactgncac361 atccatgcag tacctcttgg aggggcatca ctggtttata ggcttcaatt acagtgga<210>42<211>546<212>DNA<213>Homo sapiens<400>421 tttttttttt ctttaatttt taaaataaac ttttattttc aaacaatttt agacttacaa61 aagagttgca aagataatac agagtaattg gttttatagc atcaatttaa gcctgttctg121 atcttcttga tctattttat cactttagga taatgaaaaa taagttcagg tagaggagct181 aggctgggct tggtggctca cacctgtaat cccagcactt tgggaggcag aggtgggtgg241 attgtttgag cccagggaat ttgagaccag cctggggcaa cataggcgaa aacccatctt301 ctacaaaaaa ntacaaaaat taggccgggt ntgggtgggc acattacctg tgggtttcca361 gcttacttaa gggagggctt gagggcaggg aggaacttgc ctggaggcac aaggaggttc421 atgggctngc agttgaggcc ntgatttgtn gccantggca nttccagncc ggggtngaca481 gantgagant ttgtctnaaa aaantaaaaa accaaacaaa ncacacacac acttncacac541 aaaccc<210>43
<211>580<212>DNA<213>Homo sapiens<400>431 tttttttttt gatttatgaa aaataaattt attacataac accttgtttt taacaatgtt61 caattatttt ttattcagaa tacttgtgtc attcatgtaa aatagtttga tacagtacat121 tgtatgttat aggttttttt ttcctcaaaa attctaaggc tctcctactc cttctccttt181 gctgaagtaa gggcacccta aacaacgtgc gaacaagttt tcaaaaggga aaattaactg241 gaacggaaat tttaataggc actacaccaa ccagctttaa aatcagggac gaaaatattt301 gaattgggat ggccggcacc tttttaaggg aagttattca ttcatggctt tttttcatct361 ttaaaggcat ggcttcacaa atcaggcaac accaacgggg aaaaattaaa acactgtcnt421 atgggacaat ccntttaggg ttgttttggn tggaacccaa cggggttttt atncatggac481 aacntttttg attactttcc ataggcggnt ttaatacccg tcagggcnag ggtttggaat541 anaacaactt tagtgggggg ttntctgggg gggttttgaa<210>44<211>319<212>DNA<213>Homo sapiens<400>441 gtgtgaatta acaacgttgg cctctcggat tcccaaaact ttaaaagatt aataatatcc61 agtattggtt gggattataa gcagatatga tatttcactg atagagacca gcatgactgt121 ctcaattgtt aaactattga aatgcatctg tgtcaatggt aaatagtatt attgtaagta181 caacactaaa attaataaat tgtgtaaatt ctaaaaaaaa aaaaaaaaaa aaaaaaaaaa241 aaaaaaantt cctgcggccg caggcttttt ccctttggtg gggggttaat tttgggcttg301 ggcnctgggc cgtcgtttt<210>45<211>444<212>DNA<213>Homo sapiens<400>451 aaagattccc nnnnttttta tttctggaac tgtacaccag gtattaaagt acaacaaata61 caaaataatg ctcaaaaaaa tcagtgttta tgtacaaata ttaaaatcaa tgcaatagca121 acttcctctt gaacatctga tactgaaact tgttctgatt gtcactaatt tatctcatac181 tcacagggta cttatttcaa actgggacaa ttggaatcac tggtcatcta agaggaatat241 taatatctac catatttaac aataaaacta atagtttcct ccatttagtg aaaaaattag301 ggaacttttt aaaaaacata ggtaacgtca atattttatt aaattatttc aatttccntt361 tgtgggccat gtgctttgga aactcggggc aaacattctc cntgggnggt cgggttnttt
421 gttaggttac ttcnagttgg attt<210>46<211>407<212>DNA<213>Homo sapiens<400>461 aatacaaana tngtctcaca tacttgaact cctgtatgta agcaatcctc ctgctttggc61 ctcccaaagc gctgggatta taggcatgaa ccaccacgcc cagcctatta ctgaaatttt121 aatgatgatg acaacctcat gacaatccca aagtactagt atatccattt tactgatgag181 aaaagcacag ctaatatgca gtaaatctag tttctgaatt cagggcattc ttattccaga241 gcccatgcta ttaattccta tactatactg tctctatata aggcatgtag gcatattaat301 ctttttaagg ggggggagta ttaggcaggg cccacanggg agcaggtagc tgcctaatgt361 ggtaaaggtg gacagggccg ggggggnaaa ccnggaaaaa tctactg<210>47<211>400<212>DNA<213>Homo sapiens<400>471 ctttcttttt gcccaggctg cagtgcaatg gtgcgatctt ggctcgctgc aacctccgcc61 tgcaacctcc gcctcccggg ttcaagtgat tctcttgcct cagcctggcg agtagctagg121 tttggccaaa atatgaacta gaaaggcagg aacctggcca aaggtgggga gcttgaatgt181 tattacctta ttcctgcttg acctttactg ccaacgccaa cacaaatggt cagaggtccc241 ttatatgaaa atcttcatga tcttgtggga aaaccctgat ttttgtaaag gctgcaaaac301 aggctcaaaa ataaataaat aaataaaatt acatttaaaa gagccagttg cagtggctca361 cgacgttttt tttttttaaa ttaaaatgtt tcccatattt<210>48<211>354<212>DNA<213>Homo sapiens<400>481 taatttngta atttacattt tattctgtct gctcanctat tttccagttt ttctacagng61 tatacattat ttaccaaatg tatattttaa gatattttaa aatgttcact ctttgattca121 gtcattctac ttcttagaaa catcttaaag naataataca agaagtatgc aaagatttag181 ccacaggaat atttatncca aaaacgttta cactaccatt aaattagaaa accaaaatgc241 ataaagnggn gtataaagtt ggnctctaaa gtcattaaaa atntttaaca aaatgcttac
301 agggcattct tatgtggggg agagaaagct tacaaagcat tctgtacaat ataa<210>49<211>434<212>DNA<213>Homo sapiens<400>491 ggcttttctt accgaattgg ttaggattgt cagtacaagt gttgaattaa ggtggtaatt61 gtaggtaaat gtgtttgttc tcaatactgg gacaaagttt aaaatatctt gtttttgagt121 cttggttgaa tttttttctt aaattttaaa aggatgttga attttatgaa gacatttgta181 ccgcatctac caaaatgatg tattagttac ctattgctgc ataacaaatt acctaaacat241 tttgttggtg aggatatgaa gaactggggt atcttcaaat gctgctgggt agggaatgtg301 gaaatgggtg taggccactt tgggaaaagc ttgtcatttc cntcaaaatg tttaaacata361 cagttactgt gttcctaggc acttcttctc ccggggntat atanggcgga aaggaaatgg421 aaaaccattc attt<210>50<211>493<212>DNA<213>Homo sapiens<400>501 gctatcacat ttttntctat tcttaagcgt ttttctgcca tgatcacatt gtgatgaaga61 acatgatggt cactagtagg taactttctg tgtcattgcc ttaatctcag tgaggtgcta121 gtggatttac ctacccctgc ttttgcatca ccactgtaaa tctaatagtg aaaaggcaaa181 tgatgtctca gtatcactgt gaaaacattt ttcccttgga ccagctgaaa gcatcttgag241 gagcctgaag gcttcaaggt ccacacgtca aaaaaacaca gccctaggac tgatgggtgg301 cccattatgg gtgggggtca gcttccccct aggatcacat ggggttctgt tggggagggg361 gtgggcttgt ctgcatgacc atcttaggtc acctcaaggg ggggaagggg gcaaagggtt421 tgcccaaggc ntctacttgc gtaagtagga tagcttttaa cccttggggn aagggaagtt481 ggggtttggg tnc
權利要求
1.一類分離出的在人類肝臟中表達的表達序列標簽的序列,其特征在于,包括(a)SEQ ID No.1~SEQ ID No.50所示的序列;(b)SEQ ID No.1~SEQ ID No.50所示的序列中每條序列的互補序列;(c)與SEQ ID No.1~SEQ ID No.50所示的序列中每條序列有至少70%同源性的序列,及(d)上述(a)~(c)中一條或數條的組合。
2.根據權利要求1所述的一類分離出的在人類肝臟中表達的表達序列標簽的序列,其特征在于所述序列包括具有SEQ ID No.1~SEQ IDNo.50所示的序列。
3.一種探針分子,其特征在于所述的探針分子含有權利要求1中所述的序列中約8-100個連續的核苷酸。
全文摘要
本發明公開了一類在人類肝臟中表達的表達序列標簽的序列。利用本發明的在人類肝臟中表達的表達序列標簽,可以方便的尋找出在人類肝臟中特異表達的表達序列標簽,從而尋找出人類肝臟疾病相關基因,從而在研究肝臟疾病的致病機理以及開發治療肝臟疾病的藥物中發揮重要作用。
文檔編號C12Q1/68GK1955288SQ200510030820
公開日2007年5月2日 申請日期2005年10月28日 優先權日2005年10月28日
發明者韓澤廣, 黃健 申請人:上海人類基因組研究中心