| 网站首页 | 业界新闻 | 小组 | 威客 | 人才 | 下载频道 | 博客 | 代码贴 | 在线编程 | 编程论坛
欢迎加入我们,一同切磋技术
用户名:   
 
密 码:  
共有 1245 人关注过本帖
标题:Python按关键字提取txt文本并保存到Excel
只看楼主 加入收藏
初学者_123
Rank: 2
等 级:论坛游民
帖 子:25
专家分:14
注 册:2021-11-6
结帖率:100%
收藏
已结贴  问题点数:20 回复次数:2 
Python按关键字提取txt文本并保存到Excel
从web of science导出了许多(200多)篇文章的相关信息,如标题、摘要、关键字、作者、参考文献、起始页码,发表的期刊等等信息,不同文章之间通过空行隔开。
下面我放了两篇文章的信息,对每一篇文章来说,我只想要提取其中的部分信息,将它写成一个excel表格,关键在于如何对数据进行分割?
PT J
AU Newman, JRS
   Ghaemmaghami, S
   Ihmels, J
   Breslow, DK
   Noble, M
   DeRisi, JL
   Weissman, JS
AF Newman, John R. S.
   Ghaemmaghami, Sina
   Ihmels, Jan
   Breslow, David K.
   Noble, Matthew
   DeRisi, Joseph L.
   Weissman, Jonathan S.
TI Single-cell proteomic analysis of S-cerevisiae reveals the architecture
   of biological noise
SO NATURE
LA English
DT Article
ID STOCHASTIC GENE-EXPRESSION; MESSENGER-RNA; FLUORESCENT PROTEIN;
   EUKARYOTIC GENOME; GLOBAL ANALYSIS; YEAST PROTEOME; SYSTEM;
   LOCALIZATION; DYNAMICS; SWITCH
AB A major goal of biology is to provide a quantitative description of cellular behaviour. This task, however, has been hampered by the difficulty in measuring protein abundances and their variation. Here we present a strategy that pairs high-throughput flow cytometry and a library of GFP-tagged yeast strains to monitor rapidly and precisely protein levels at single-cell resolution. Bulk protein abundance measurements of >2,500 proteins in rich and minimal media provide a detailed view of the cellular response to these conditions, and capture many changes not observed by DNA microarray analyses. Our single-cell data argue that noise in protein expression is dominated by the stochastic production/destruction of messenger RNAs. Beyond this global trend, there are dramatic protein-specific differences in noise that are strongly correlated with a protein's mode of transcription and its function. For example, proteins that respond to environmental changes are noisy whereas those involved in protein synthesis are quiet. Thus, these studies reveal a remarkable structure to biological noise and suggest that protein noise levels have been selected to reflect the costs and potential benefits of this variation.
C1 Univ Calif San Francisco, Howard Hughes Med Inst, San Francisco, CA 94107 USA.
   Univ Calif San Francisco, Dept Cellular & Mol Pharmacol, San Francisco, CA 94107 USA.
   Univ Calif San Francisco, Dept Biochem & Biophys, San Francisco, CA 94107 USA.
   Calif Inst Quantitat Biomed Res, San Francisco, CA 94107 USA.
RP Newman, JRS (通讯作者),Univ Calif San Francisco, Howard Hughes Med Inst, 1700 4th St, San Francisco, CA 94107 USA.
EM weissman@cmp.ucsf.edu
OI Ghaemmaghami, Sina/0000-0002-8696-2950; Breslow,
   David/0000-0003-0245-3348
CR Andersen JS, 2005, NATURE, V433, P77, DOI 10.1038/nature03207
   Ashburner M, 2000, NAT GENET, V25, P25, DOI 10.1038/75556
   Balaban NQ, 2004, SCIENCE, V305, P1622, DOI 10.1126/science.1099390
   Barkai N, 2000, NATURE, V403, P267, DOI 10.1038/35002258
   Becskei A, 2005, NAT GENET, V37, P937, DOI 10.1038/ng1616
   Biggar SR, 2001, EMBO J, V20, P3167, DOI 10.1093/emboj/20.12.3167
   Blake WJ, 2003, NATURE, V422, P633, DOI 10.1038/nature01546
   Colman-Lerner A, 2005, NATURE, V437, P699, DOI 10.1038/nature03998
   Cormack BP, 1996, GENE, V173, P33, DOI 10.1016/0378-1119(95)00685-0
   Edwards BS, 1999, CYTOMETRY, V37, P156, DOI 10.1002/(SICI)1097-0320(19991001)37:2<156::AID-CYTO9>3.0.CO;2-T
   EITZMAN PD, 1989, CYTOMETRY, V10, P475, DOI 10.1002/cyto.990100417
   Elowitz MB, 2002, SCIENCE, V297, P1183, DOI 10.1126/science.1070919
   Fagarasanu M, 2005, J CELL BIOL, V169, P765, DOI 10.1083/jcb.200503083
   Felice MR, 2005, J BIOL CHEM, V280, P22181, DOI 10.1074/jbc.M414663200
   Ferrell JE, 1998, SCIENCE, V280, P895, DOI 10.1126/science.280.5365.895
   Florens L, 2002, NATURE, V419, P520, DOI 10.1038/nature01107
   Fraser HB, 2004, PLOS BIOL, V2, P834, DOI 10.1371/journal.pbio.0020137
   Futcher B, 1999, MOL CELL BIOL, V19, P7357
   Ghaemmaghami S, 2003, NATURE, V425, P737, DOI 10.1038/nature02046
   Gygi SP, 1999, MOL CELL BIOL, V19, P1720, DOI 10.1128/mcb.19.3.1720
   Harbison CT, 2004, NATURE, V431, P99, DOI 10.1038/nature02800
   Hershko A, 1998, ANNU REV BIOCHEM, V67, P425, DOI 10.1146/annurev.biochem.67.1.425
   Holstege FCP, 1998, CELL, V95, P717, DOI 10.1016/S0092-8674(00)81641-4
   Huh WK, 2003, NATURE, V425, P686, DOI 10.1038/nature02026
   Huisinga KL, 2004, MOL CELL, V13, P573, DOI 10.1016/S1097-2765(04)00087-5
   Kellis M, 2003, NATURE, V423, P241, DOI 10.1038/nature01644
   Kumar A, 2002, GENE DEV, V16, P707, DOI 10.1101/gad.970902
   Lahav G, 2004, NAT GENET, V36, P147, DOI 10.1038/ng1293
   NOVICK A, 1957, P NATL ACAD SCI USA, V43, P553, DOI 10.1073/pnas.43.7.553
   Ozbudak EM, 2002, NAT GENET, V31, P69, DOI 10.1038/ng869
   Paulsson J, 2005, NAT GENET, V37, P925, DOI 10.1038/ng0905-925
   Paulsson J, 2004, NATURE, V427, P415, DOI 10.1038/nature02257
   Raser JM, 2005, SCIENCE, V309, P2010, DOI 10.1126/science.1105891
   Raser JM, 2004, SCIENCE, V304, P1811, DOI 10.1126/science.1098641
   Samoilov M, 2005, P NATL ACAD SCI USA, V102, P2310, DOI 10.1073/pnas.0406841102
   Schrodinger E., 1944, WHAT IS LIFE PHYS AS
   Shaner NC, 2004, NAT BIOTECHNOL, V22, P1567, DOI 10.1038/nbt1037
   Wang YL, 2002, P NATL ACAD SCI USA, V99, P5860, DOI 10.1073/pnas.092538799
   WARNER JR, 1989, MICROBIOL REV, V53, P256, DOI 10.1128/MMBR.53.2.256-271.1989
   Washburn MP, 2003, P NATL ACAD SCI USA, V100, P3107, DOI 10.1073/pnas.0634629100
   Wei J, 2005, J PROTEOME RES, V4, P801, DOI 10.1021/pr0497632
   Wodicka L, 1997, NAT BIOTECHNOL, V15, P1359, DOI 10.1038/nbt1297-1359
   Wu JQ, 2005, SCIENCE, V310, P310, DOI 10.1126/science.1113230
   Yu LN, 1999, MOL CELL BIOL, V19, P5279
   Zaslaver A, 2004, NAT GENET, V36, P486, DOI 10.1038/ng1348
NR 45
TC 1093
Z9 1103
U1 3
U2 220
PU NATURE PUBLISHING GROUP
PI LONDON
PA MACMILLAN BUILDING, 4 CRINAN ST, LONDON N1 9XW, ENGLAND
SN 0028-0836
EI 1476-4687
J9 NATURE
JI Nature
PD JUN 15
PY 2006
VL 441
IS 7095
BP 840
EP 846
DI 10.1038/nature04785
PG 7
WC Multidisciplinary Sciences
WE Science Citation Index Expanded (SCI-EXPANDED)
SC Science & Technology - Other Topics
GA 052SL
UT WOS:000238254100035
PM 16699522
DA 2022-05-02
ER

PT J
AU Thattai, M
   van Oudenaarden, A
AF Thattai, M
   van Oudenaarden, A
TI Intrinsic noise in gene regulatory networks
SO PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF
   AMERICA
LA English
DT Article
ID ESCHERICHIA-COLI; TRANSCRIPTION INITIATION; EXPRESSION; FLUCTUATIONS;
   TRANSLATION; PROMOTER; MODEL; REPRESSOR; LAMBDA
AB Cells are intrinsically noisy biochemical reactors: low reactant numbers can lead to significant statistical fluctuations in molecule numbers and reaction rates. Here we use an analytic model to investigate the emergent noise properties of genetic systems. We find for a single gene that noise is essentially determined at the translational level, and that the mean and variance of protein concentration can be independently controlled. The noise strength immediately following single gene induction is almost twice the final steady-state value, We find that fluctuations in the concentrations of a regulatory protein can propagate through a genetic cascade; translational noise control could explain the inefficient translation rates observed for genes encoding such regulatory proteins, For an autoregulatory protein, we demonstrate that negative feedback efficiently decreases system noise. The model can be used to predict the noise characteristics of networks of arbitrary connectivity. The general procedure is further illustrated for an autocatalytic protein and a bistable genetic switch, The analysis of intrinsic noise reveals biological roles of gene network structures and can lead to a deeper understanding of their evolutionary origin.
C1 MIT, Dept Phys, Cambridge, MA 02139 USA.
RP van Oudenaarden, A (通讯作者),MIT, Dept Phys, Room 13-2010,77 Massachusetts Ave, Cambridge, MA 02139 USA.
EM avano@mit.edu
RI van Oudenaarden, Alexander/AAA-1705-2019
CR Arkin A, 1998, GENETICS, V149, P1633
   Becskei A, 2000, NATURE, V405, P590, DOI 10.1038/35014651
   Berg OG, 2000, BIOPHYS J, V79, P2944, DOI 10.1016/S0006-3495(00)76531-3
   BERG OG, 1978, J THEOR BIOL, V71, P587, DOI 10.1016/0022-5193(78)90326-0
   Bhalla US, 1999, SCIENCE, V283, P381, DOI 10.1126/science.283.5400.381
   Carrier TA, 1999, J THEOR BIOL, V201, P25, DOI 10.1006/jtbi.1999.1010
   CHAPON C, 1982, EMBO J, V1, P369, DOI 10.1002/j.1460-2075.1982.tb01176.x
   Cook DL, 1998, P NATL ACAD SCI USA, V95, P15641, DOI 10.1073/pnas.95.26.15641
   DeHaseth PL, 1998, J BACTERIOL, V180, P3019, DOI 10.1128/JB.180.12.3019-3025.1998
   Elowitz MB, 2000, NATURE, V403, P335, DOI 10.1038/35002125
   Gardner TS, 2000, NATURE, V403, P339, DOI 10.1038/35002131
   GILLESPIE DT, 1977, J PHYS CHEM-US, V81, P2340, DOI 10.1021/j100540a008
   Hasty J, 2000, P NATL ACAD SCI USA, V97, P2075, DOI 10.1073/pnas.040411297
   KENNELL D, 1977, J MOL BIOL, V114, P1, DOI 10.1016/0022-2836(77)90279-0
   KO MSH, 1991, J THEOR BIOL, V153, P181, DOI 10.1016/S0022-5193(05)80421-7
   McAdams HH, 1999, TRENDS GENET, V15, P65, DOI 10.1016/S0168-9525(98)01659-X
   McAdams HH, 1997, P NATL ACAD SCI USA, V94, P814, DOI 10.1073/pnas.94.3.814
   MCCLURE WR, 1985, ANNU REV BIOCHEM, V54, P171, DOI 10.1146/annurev.biochem.54.1.171
   Paulsson J, 2000, P NATL ACAD SCI USA, V97, P7148, DOI 10.1073/pnas.110057697
   Paulsson J, 2000, PHYS REV LETT, V84, P5447, DOI 10.1103/PhysRevLett.84.5447
   PECCOUD J, 1995, THEOR POPUL BIOL, V48, P222, DOI 10.1006/tpbi.1995.1027
   SCHLAX PJ, 1995, J MOL BIOL, V245, P331, DOI 10.1006/jmbi.1994.0028
   SCHMITT B, 1995, BIOCHEM J, V306, P123, DOI 10.1042/bj3060123
   SHEA MA, 1985, J MOL BIOL, V181, P211, DOI 10.1016/0022-2836(85)90086-5
   Siegele DA, 1997, P NATL ACAD SCI USA, V94, P8168, DOI 10.1073/pnas.94.15.8168
   van Kampen NG, 1992, STOCHASTIC PROCESSES
   VANPUTTE P, 1992, TRENDS GENET, V8, P457
   von Dassow G, 2000, NATURE, V406, P188, DOI 10.1038/35018085
   YARCHUK O, 1992, J MOL BIOL, V226, P581, DOI 10.1016/0022-2836(92)90617-S
NR 29
TC 1022
Z9 1033
U1 4
U2 72
PU NATL ACAD SCIENCES
PI WASHINGTON
PA 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA
SN 0027-8424
J9 P NATL ACAD SCI USA
JI Proc. Natl. Acad. Sci. U. S. A.
PD JUL 17
PY 2001
VL 98
IS 15
BP 8614
EP 8619
DI 10.1073/pnas.151588598
PG 6
WC Multidisciplinary Sciences
WE Science Citation Index Expanded (SCI-EXPANDED)
SC Science & Technology - Other Topics
GA 454HK
UT WOS:000169967000074
PM 11438714
OA Bronze, Green Published
DA 2022-05-02
ER
搜索更多相关主题的帖子: For 信息 and CELL the 
2022-05-03 19:46
古123
Rank: 9Rank: 9Rank: 9
等 级:贵宾
威 望:14
帖 子:219
专家分:1098
注 册:2017-2-5
收藏
得分:20 
循环读取每行
判断字符串中是否包含其他字符串用:
程序代码:
if 关键字 in 字符串:
    print('该行中有关键字')
else
    print('该行中无该关键字')


然后split分割,变量存储就好
2022-05-04 09:17
初学者_123
Rank: 2
等 级:论坛游民
帖 子:25
专家分:14
注 册:2021-11-6
收藏
得分:0 
谢谢大佬
2022-05-04 21:13
快速回复:Python按关键字提取txt文本并保存到Excel
数据加载中...
 
   



关于我们 | 广告合作 | 编程中国 | 清除Cookies | TOP | 手机版

编程中国 版权所有,并保留所有权利。
Powered by Discuz, Processed in 0.041008 second(s), 8 queries.
Copyright©2004-2024, BCCN.NET, All Rights Reserved