Data Management and UtilizationÊý¾Ý¹ÜÀíÓëÀûÓûùÓÚTCGAÊý¾ÝÍÚ¾òµÄÈéÏÙ°©Ô¤ºóÏà¹ØmRNAÑо¿·¶ÈÙ¢Ù Ò¶Ã÷È«¢Ú£ª ½¯ÓÀ³¬¢Ù ÐÜÔÂÁá¢Ù¡¾ÕªÒª¡¿ Ä¿µÄ£ºÑ°ÕÒ¿É×÷ΪÈéÏÙ°©ÉúÎï±êÖ¾ÎïµÄmRNA¡£·½·¨£ºÀûÓð©Ö¢»ùÒò×éͼÆ׺ͻùÒò±í´ïÊý¾Ý¿âµÄת¼×é²âÐòÊý¾Ý¼¯½øÐвîÒì·ÖÎöºÍ¹¦Äܸ»¼¯·ÖÎö£¬¹¹½¨µ°°×Ï໥×÷Óã¨Protein Protein Interaction£¬PPI£©ÍøÂ磬Ȼºó²ÉÓÃLasso»Ø¹é¹¹½¨·çÏÕÔ¤ºóÄ£ÐÍ£¬ÒÔ¼ø±ð¸ß·çÏպ͵ͷçÏÕ²¡Àý¡£½á¹û£º¹²¼ø¶¨³ö176¸öÉϵ÷mRNAºÍ562¸öϵ÷mRNA£»PPIÍøÂçÖ÷ÒªÓÉ6¸öÄ£¿é×é³É£»·çÏÕÔ¤ºóÄ£ÐÍÓÉATP5BµÈ16¸ömRNA×é³É£»Ô¤²âѵÁ·¼¯ºÍÑéÖ¤¼¯µÄÒ»Äê¡¢ÈýÄêºÍÎåÄêROCÇúÏßÏÂÃæ»ý·Ö±ðΪ0.820¡¢0.792¡¢0.747ºÍ0.741¡¢0.747¡¢0.699¡£½áÂÛ£ºÑо¿³É¹ûΪÈéÏÙ°©Õï¶ÏºÍÔ¤ºóÌṩºòÑ¡ÉúÎï±êÖ¾ÎҲΪ½øÒ»²½²ûÃ÷ÈéÏÙ°©·Ö×Ó²¡Àí»úÖÆÌṩÉúÎïÐÅϢѧ»ù´¡¡£¡¾¹Ø¼ü´Ê¡¿ mRNA TCGA ÈéÏÙ°© PPI Ô¤ºóDoi:10.3969/j.issn.1673-7571.2021.01.024¡¾ÖÐͼ·ÖÀàºÅ¡¿ R319 ¡¾ÎÄÏ×±êʶÂë¡¿ APrognosis-related mRNA of Breast Cancer Based on TCGA Data Mining / FAN Rong, YE Ming-quan, JIANG Yong-chao, et al.Abstract: Objective: Looking for mRNA that can be used as breast cancer biomarkers. Methods: To use the transcriptome sequencing data set of the Cancer Genome Atlas and the Gene Expression Omnibus for differential analysis and functional enrichment analysis, construct a protein interaction (PPI) network, and then use Lasso regression to construct a risk prognosis model to identify high-risk and low-risk cases. Results: A total of 176 up-regulated mRNAs and 562 down-regulated mRNA were identi?ed. The PPI network is mainly composed of 6 modules. The risk prognosis model consists of 16 mRNA including ATP5B. The area under the receiver operating characteristic (ROC) curve for the training set and the validation set for one, three and five years are 0.820, 0.792, 0.747 and 0.741, 0.747, 0.699 respectively. Conclusion: The research results provide candidate biomarkers for the diagnosis and prognosis of breast cancer, and provide a bioinformatics basis for further elucidating the molecular pathology of breast cancer.Keywords: mRNA, TCGA, breast cancer, PPI, prognosisFund project: General Project of National Natural Science Foundation of China (No. 61672386); General Project of Natural Science Foundation of Anhui Province (No. 1708085MF142); Research Planning Fund of Humanities and Social Sciences of Ministry of Education (No. 16YJAZH071)Corresponding author: School of Medical Information, Wannan Medical College, Wuhu 241002, Anhui Province, P.R.C.»ù½ðÏîÄ¿£º¹ú¼Ò×ÔÈ»¿Æѧ»ù½ðÃæÉÏÏîÄ¿£¨±àºÅ£º61672386£©£»°²»ÕÊ¡×ÔÈ»¿Æѧ»ù½ðÃæÉÏÏîÄ¿£¨±àºÅ£º1708085MF142£©£»½ÌÓý²¿ÈËÎÄÉç»á¿ÆѧÑо¿¹æ»®»ù½ð£¨±àºÅ£º16YJAZH071£©£ªÍ¨ÐÅ×÷ÕߣºÍîÄÏҽѧԺҽѧÐÅϢѧԺ£¬241002£¬°²»ÕÊ¡ÎߺþÊи߽ÌÔ°ÎIJýÎ÷·22ºÅ¢ÙÍîÄÏҽѧԺҽѧÐÅϢѧԺ£¬241002£¬°²»ÕÊ¡ÎߺþÊи߽ÌÔ°ÎIJýÎ÷·22ºÅ¢ÚÍîÄÏҽѧԺ½¡¿µ´óÊý¾ÝÍÚ¾òÓëÓ¦ÓÃÑо¿ÖÐÐÄ£¬241002£¬°²»ÕÊ¡ÎߺþÊи߽ÌÔ°ÎIJýÎ÷·22ºÅ¡¶ÖйúÊý×Öҽѧ¡·2021µÚ16¾í µÚ1ÆÚ ¡¤107Êý¾Ý¹ÜÀíÓëÀûÓÃData Management and UtilizationÈéÏÙ°©ÊÇÅ®ÐÔ×î³£¼ûµÄ¶ñÐÔÖ×1.2 Ñо¿·½·¨ ÁöÖ®Ò»[1-2]¡£2019Äê¹À¼ÆÓÐ268 6001.2.1 ÈéÏÙ°©Êý¾Ý¼¯²îÒì·ÖÎö ²ÉÓÃÀýз¢²¡ÀýºÍ41 760ÀýËÀÍö²¡Àý£¬RÓïÑÔedgRÊý¾Ý°ü²îÒì·ÖÎöTCGA-Õ¼ËùÓÐз¢°©Ö¢²¡ÀýµÄ30%ºÍ°©Ö¢BRCA£¬ÉèÖÃɸѡ±ê×¼P<0.001£¬Ïà¹ØËÀÍöµÄ15%[3]¡£Ä¿Ç°ÈéÏÙ°©ÁÙ|log2FC|>1£»ÀûÓÃGEO2R·½·¨²îÒì´²ÖÎÁƵõ½¸ÄÉÆ£¬µ«ÒòÆäÔ¤ºó²îºÍ·ÖÎöGSE139038£¬ÉèÖÃɸѡ±ê×¼¸ßËÀÍöÂÊ£¬ÈéÏÙ°©ÈÔÑÏÖØÍþвŮÐÔP<0.01£¬|log2FC|>1£»¶ÔÉÏÊöÁ½ÖÖ½¡¿µ£¬²¡Çé¶ñ»¯»¹»áÓ°Ï컼ÕßÉúÃü²îÒì·ÖÎö½á¹û½øÐн»¼¯ÔËËã¡£°²È«¡£Ëæ×ÅÖ×Áö»ùÒò×éѧµÄ¿ìËÙ·¢1.2.2 ²îÒì±í´ï»ùÒòµÄ¹¦Äܸ»¼¯·ÖÕ¹£¬·¢Ïָ߶ÈÌØÒìÐÔµÄÉúÎï±êÖ¾ÎïÎö Õë¶Ô¹²Í¬Éϵ÷mRNAºÍ¹²Í¬ÏÂÒѳÉΪÈéÏÙ°©ÁÙ´²Õï¶ÏºÍÖÎÁƵÄÄÚµ÷mRNA½øÐй¦Äܸ»¼¯·ÖÎö£¬°üÔÚÐèÇó[4]¡£À¨ÉúÎï½ø³Ì¡¢·Ö×Ó¹¦ÄܺÍϸ°û³É°©Ö¢»ùÒò×éͼÆ×£¨The Cancer ·Ö¡£ÀûÓÃR°üclusterProfiler½øÐй¦Genome Atlas£¬TCGA[5]£©ºÍ»ùÄܸ»¼¯·ÖÎö£¬P<0.05ÓÐͳ¼Æѧ²îÒò±í´ïÊý¾Ý¿â£¨Gene Expression Ò죬ÿ¸öÏîÄ¿Ñ¡È¡Ç°10¸öÀàÄ¿½øOmnibus£¬GEO£©×÷Ϊ×îȨÍþµÄ°©ÐÐ×÷ͼ¡£Ö¢»ùÒò×éÊý¾Ý¿â£¬Îª´ó¹æÄ£Ö×ÁöÑÐ1.2.3 µ°°×»¥×÷ÍøÂçºÍMCODE·ÖÎö ¾¿Ìṩ·á¸»µÄÔʼÊý¾Ý¡£±¾Ñо¿Àû²ÉÓÃMetascapeÔÚÏßÊý¾Ý¿â£¨http://ÓÃTCGAºÍGEOÖÐÈéÏÙ°©Ïà¹ØÊý¾Ýmetas cape.org/gp/index.html#/main/¼¯½øÐвîÒì·ÖÎö£¬ÒÔʶ±ð¹²Í¬µÄ²îstep1£©¶Ô176¸ö¹²Í¬Éϵ÷mRNA½øÒì±í´ï»ùÒò£¨Di?erentially Expressed Ðе°°×ÖÊÏ໥×÷Óã¨Protein-Protein Genes£¬DEG£©£¬²¢¶Ô²îÒì±í´ï»ùÒòInteraction£¬PPI£©ÍøÂç·ÖÎö¡£ÀûÓýøÐй¦Äܸ»¼¯·ÖÎö¡£È»ºó£¬±¾Ñо¿Ó¦CytoscapeÈí¼þ£¨https://cyto scape.Óõ¥±äÁ¿Cox»Ø¹éºÍLasso»Ø¹é[6]·ÖÎöorg/£©½øÐпÉÊÓ»¯£¬MCODE²å¼þ¹¹½¨ÈéÏÙ°©Ô¤ºóÄ£ÐÍ£¬Ô¤²â»¼ÕßԤʶ±ð¹Ø¼üÄ£¿é¡£Ã¿¸öÄ£¿éÖй¦Äܺó¡£ÆäÖУ¬Í¨¹ýµ¥ÒòËØCox»Ø¹é£¬¸»¼¯·ÖÎö¾ùÔÚMetascapeÊý¾Ý¿âÖÐÒÔP<0.001Ϊ±ê×¼£¬È·¶¨49¸ö¿ÉÒÔ½øÐС£Ô¤²âÔ¤ºóµÄmRNA£»Í¨¹ýLasso»Ø1.2.4 ¹¹½¨ÈéÏÙ°©Ô¤ºóÄ£ÐÍ[7] ÀûÓùé½øÒ»²½É¸Ñ¡16¸ömRNA×é³ÉÒ»¸öÈéÏÙ°©Êý¾Ý¼¯£¨TCGA-BRCA£©·çÏÕÔ¤ºóÄ£ÐÍ£¬ÀûÓÃÈéÏÙ°©4ÖÖÑǽøÐе¥±äÁ¿Cox»Ø¹é·ÖÎöºÍLassoÐÍÆÀ¹À¸ÃÔ¤ºóÄ£Ð͵ÄÁÙ´²Ð§Óá£Ñлعé·ÖÎö£¬ÒÔÈ·¶¨Ô¤ºóÏà¹Ø»ù¾¿½á¹ûΪԤ²âºÍÆÀ¹ÀÈéÏÙ°©»¼ÕßµÄÒò£¬¹¹½¨Ô¤ºó»ùÒò¼¯¡£µ¥ÒòËØÁÙ´²Ô¤ºóÌṩеļû½â¡£Cox»Ø¹é·ÖÎöP<0.001±»ÈÏΪ¾ßÓÐÃ÷ÏÔͳ¼ÆѧÒâÒå¡£Ô¤ºó»ùÒò1 ×ÊÁÏÓë·½·¨ÌØÕ÷±íÏÖΪ·çÏÕÆÀ·Ö=£¨ÏµÊý1.1 ×ÊÁÏÀ´Ô´ ±¾Ñо¿´ÓTCGAÈéÏÙmRNA1¡Á±í´ïmRNA1£©+£¨ÏµÊý°©Êý¾Ý¼¯£¨TCGA-BRCA£©ºÍGEOmRNA2¡Á±í´ïmRNA2£©+¡+£¨ÏµÈéÏÙ°©Êý¾Ý¼¯£¨GSE139038£©ÖÐÊýmRNAn¡Á±í´ïmRNAn£©¡£R°ü»ñÈ¡mRNAºÍÁÙ´²Êý¾Ý¡£ÆäÖУ¬¡°Survival¡±ºÍ¡°Survminer¡±±»ÓÃTCGA-BRCAÊý¾Ý¼¯°üº¬1 102¸öÈéÓÚ̽Ë÷·çÏյ÷ֵÄ×î¼Ñ½Ø¶ÏÖµ£¬ÏÙ°©Ñù±¾ºÍ113¸öÕý³£ÈéÏÙÑù±¾£»»æÖÆKaplan-MeierÉú´æÇúÏß¡£R°üGSE139038°üº¬41¸öÈéÏÙ°©Ñù±¾ºÍ¡°Survminer¡±ÖС°surv_cutpoint¡±24¸öÕý³£ÈéÏÙÑù±¾¡£º¯ÊýÓÃÓÚÈ·¶¨½«»¼Õß·ÖΪ¸ß·çÏÕ108¡¤China Digital Medicine. 2021,Vol.16,No.1×éºÍµÍ·çÏÕ×éµÄ×î¼Ñ½Ø¶ÏÖµ¡£R°ü¡°survivalROC¡±ÓÃÓÚÑо¿Ô¤ºó»ùÒò¼¯µÄʱ¼äÒÀÀµÐÔÔ¤ºó¼ÛÖµ¡£Ë«Î²logrank¼ìÑéP<0.05±»ÈÏΪ¶Ô»¼ÕßÉú´æÔ¤ºó¾ßÓÐͳ¼ÆѧÒâÒå¡£2 ½á¹û2.1 ʶ±ðÈéÏÙ°©²îÒì±í´ï»ùÒò ͨ¹ý²îÒì·ÖÎö£¬µÃµ½TCGA-BRCAÖÐ1 255¸ömRNAÏÔÖøÉϵ÷£¬1 015¸ömRNAÏÔÖøϵ÷£¨Í¼1A£©£¬ÒÔ¼°GSE13908ÖÐ438¸ömRNAÏÔÖøÉϵ÷ºÍ2 430¸ömRNAÏÔÖøϵ÷£¨Í¼1B£©£¬Í¨¹ý½»¼¯ÔËË㣬µÃµ½176¸ö¹²Í¬Éϵ÷mRNAºÍ562¸ö¹²Í¬Ïµ÷mRNA£¨Í¼1C£¬Í¼1D£©¡£2.2 ²îÒì±í´ïmRNAµÄ¹¦Äܸ»¼¯·ÖÎö Ñ¡È¡¹²Í¬Éϵ÷ºÍ¹²Í¬Ïµ÷mRNA·Ö±ð×ö¹¦Äܸ»¼¯·ÖÎö£¬ÒÔÏÔʾÔÚÈéÏÙ°©Öйý¶È¼¤»îµÄ¹¦ÄÜͨ·ºÍÊÜÒÖÖƹ¦ÄÜ¡£ÔÚÉúÎï½ø³Ì·½Ã棬¹²Í¬Éϵ÷mRNAÖ÷Òª²ÎÓëÓÐË¿·ÖÁÑ¡¢È¾É«Ìå·ÖÁѵÈÉúÎ﹦ÄÜ£¬¹²Í¬Ïµ÷mRNAÔòÖ÷Òª²ÎÓë°¢Ã×°ÍÑùǨÒÆ¡¢Ñª¹Ü·¢ÓýµÄµ÷½ÚµÈ¡£ÔÚϸ°û³É·Ö·½Ã棬¹²Í¬Éϵ÷mRNAÖ÷Òª¼¯ÖÐÔÚ·Ä´¸ÌåºÍȾɫÌåÇøÓò£¬¹²Í¬Ïµ÷mRNAÔòÖ÷Òª¼¯ÖÐÔÚϸ°ûÍâ»ùÖʽºÔºÍϸ°û¼äÁ´½Ó³É·Ö¡£ÔÚ·Ö×Ó¹¦ÄÜ·½Ã棬¹²Í¬Éϵ÷mRNAÖ÷Òª¹¦ÄÜΪ΢¹Üµ°°×½áºÏºÍATPø»îÐÔ£¬¹²Í¬Ïµ÷»ùÒòÖ÷Òª¹¦ÄÜÔòΪ¼¡¶¯µ°°×½áºÏºÍÁò»¯Îï½áºÏ£¨Í¼2A£¬Í¼2B£©¡£ 2.3 µ°°×Ï໥×÷ÓÃÍøÂç¹¹½¨ ͨ¹ý¹¦Äܸ»¼¯·¢ÏÖ£¬ÈéÏÙ°©Öй²Í¬Éϵ÷mRNAÖ÷Òª²ÎÓëÓÐË¿·ÖÁÑ¡¢·Ä´¸Ì塢΢¹Üµ°°×½áºÏµÈ¡£Îª½øÒ»²½Ì½ÌÖ¹²Í¬Éϵ÷mRNAµÄ»¥×÷Çé¿ö£¬ÀûÓù²Í¬Éϵ÷mRNA¹¹½¨Ò»¸öÕûÌåµ°°×Ï໥×÷Óã¨PPI£©ÍøÂ磨ͼ3A£©£¬Í¨¹ýCytoscapeÈí¼þÖÐMCODE²å¼þʶ±ð6¸öÑǼ¯£¨Í¼3B£©¡£Í¨¹ý¶Ô6¸öÑǼ¯µÄ¹¦ÄÜData Management and UtilizationÊý¾Ý¹ÜÀíÓëÀûÓÃNF674ѵÁ·×éÖУ¬¸ù¾Ý×î¼Ñ½Ø¶ÏÖµ17.414£¨Í¼4A£©½«744¸öÑù±¾·ÖΪ¸ß·çÏÕ×éºÍµÍ·çÏÕ×飬½á¹ûÏÔʾ¸ß·çÏÕ×é×ÜÌåÉú´æÆÚ£¨Overall Survival£¬OS£©Ã÷ÏÔ¸ü²î£¨Í¼5A£©¡£Ò»Äê¡¢ÈýÄêºÍÎåÄêROCÇúÏßÏÂÃæ»ý·Ö±ðΪ0.820¡¢0.792ºÍ0.747£¨Í¼6A£©¡£ÑéÖ¤×éÖУ¬¸ù¾Ý×î¼Ñ½Ø¶ÏÖµ17.457£¨Í¼4B£©½«318¸öÑù±¾·ÖΪ¸ß·çÏÕ×éºÍµÍ·çÏÕ×é¡£½á¹ûÏÔʾ¸ß·çÏÕ×éµÄOSÃ÷ÏÔµÍÓڵͷçÏÕ×飨ͼ5B£©¡£Ò»Äê¡¢ÈýÄêºÍÎåÄêROCÇúÏßÏÂÃæ»ý·Ö±ðΪ0.741¡¢0.747ºÍ0.699£¨Í¼6B£©¡£ 2.5 Ô¤ºóÄ£ÐÍÔÚÈéÏÙ°©ÑÇÐÍÖеÄÑéͼ1 ²îÒì·ÖÎö½á¹ûÖ¤ ÈéÏÙ°©Ö÷Òª·ÖΪ»ùµ×ϸ°ûÐÍ¡¢¸»¼¯·ÖÎö£¬·¢ÏÖµÚ1¸öÑǼ¯°üº¬18AM47E-STBD1+0.544481053*INHer2»ùÒòÑôÐÔÐÍ¡¢¹ÜÇ»AÐͺ͹ܸömRNA£¬Ö÷Òª²ÎÓë½ãÃÃȾɫµ¥ÌåPP5A+0.012442297*MMGT1+0Ç»BÐ͵È4ÖÖÑÇÐÍ¡£ÓÉÓÚ4ÖÖÑÇÐÍÔ¤Äý¾ÛÁ¦µÄ·Ö½â¡¢ÓÐË¿·ÖÁÑÇ°ÖÐÆÚ.240226175*MRO+0.127665554ºó²»Ò»Ö£¬±¾ÎÄ·Ö±ðÔÚ4ÖÖÑÇÐÍÖкÍÓÐË¿·ÖÁѵÄMÆÚ£¨Í¼3C£©£»µÚ2*MURC+0.034864656*PGK1-0-ÑéÖ¤·çÏÕÔ¤ºóÄ£Ð͵ÄЧ¹û¡£½á¹û¸öÑǼ¯°üº¬12¸ömRNA£¬Ö÷Òª¸»¼¯.052263593*RPL29+0.014546939*ÏÔʾ£¬ÔÚ»ùµ×ϸ°ûÐÍ£¨Í¼7A£©¡¢ÔÚͬԴ¶¨ÏòÐÞ¸´ºÍDNAË«Á´¶ÏÁÑSDCBP2-0.217423827*SIPA1L1+0.Her2»ùÒòÑôÐÔÐÍ£¨Í¼7B£©¡¢¹ÜÇ»ÐÞ¸´£»µÚ3ºÍµÚ4¸öÑǼ¯·Ö±ð°üº¬013488637*STXBP5+0.236903211*AÐÍ£¨Í¼7C£©ºÍ¹ÜÇ»BÐÍ£¨Í¼7D£©4¸ömRNA£¬Ö÷Òª¹¦ÄÜ°üÀ¨²ÎÓëÓÐTARS+0.156997454*TMEM233+0.2ÖУ¬¸ß·çÏÕ»¼ÕßÔ¤ºó¾ù±ÈµÍ·çÏÕ»¼Ë¿·ÖÁÑG1-G1/SÆÚºÍECT2-KIF23-02116355*WWOX+0.054385153*ZÕßÔ¤ºó²î£¬½á¹û¾ùÓÐͳ¼ÆѧÒâÒå¡£PACGAP1¸´ºÏÌ壻µÚ5ºÍµÚ6¸öÑǼ¯·Ö±ð°üº¬3¸ömRNA£¬²ÎÓëG2/MÆÚµÄת»¯ºÍ¸ÆÀë×Ó½øÈë°û½¬µÄÕýÏòµ÷½Ú¡£2.4 ÈéÏÙ°©Ô¤ºóÄ£Ð͹¹½¨ ½«TCGAÈéÏÙ°©Ñù±¾°´7:3µÄ±ÈÀýËæ»ú·ÖΪѵÁ·×éºÍÑéÖ¤×飬ÆäÖÐѵÁ·×é°üÀ¨744¸öÑù±¾£¬ÑéÖ¤×é°üÀ¨318¸öÑù±¾¡£ÑµÁ·×éÖУ¬Í¨¹ýÅúÁ¿µ¥ÒòËػعé·ÖÎö£¬É¸Ñ¡³ö49¸ömRNA×÷ΪÈéÏÙ°©Ô¤ºóÒòËØ£¨P<0.001£©¡£Í¨¹ýLasso»Ø¹é·ÖÎö½øÒ»²½½µÎ¬£¬»ñµÃÒ»¸öÓÉ16¸ömRNA×é³ÉµÄ·çÏÕÔ¤ºóÄ£ÐÍ¡£·çÏÕÖµ¼ÆËãÄ£ÐÍ£º0.117692865*ATP5B+0.122637274*DCUN1D4+0.23425161*Fͼ2 ²îÒì±í´ï»ùÒòµÄ¹¦Äܸ»¼¯·ÖÎö¡¶ÖйúÊý×Öҽѧ¡·2021µÚ16¾í µÚ1ÆÚ ¡¤109
»ùÓÚTCGAÊý¾ÝÍÚ¾òµÄÈéÏÙ°©Ô¤ºóÏà¹ØmRNAÑо¿



