24小时热门版块排行榜    

查看: 4367  |  回复: 100
【奖励】 本帖被评价86次,作者1949stone增加金币 67.8

[资源] 生物信息学-BIOINFORMATICS: A CONCEPT-BASED INTRODUCTION

Scientific disciplines evolve and mature into different areas of specialization
to accommodate new knowledge and methods that are being developed by
the research community. The last decade has seen a dramatic change in most
fields and the information technology has revolutionized several fields.
Bioinformatics is the perfect marriage between computer science and
advanced biology. Complex biological processes, macromolecular
components and their functional interplay define the basis of living cells.
Biological experiments that aim to reveal the complexity of cellular systems
and biomolecular functions produce huge volumes of data or information
that needs to be efficiently handled for tangible results. Exponential increase
in genome sequences, protein sequences, protein interactions and biological
networks/pathways information has created a demand for efficient
information handling. This led to the birth of the field of Bioinformatics that
aims to handle biological information using computational methods and
algorithms. Bioinformatics is evolving into a mature field with an everincreasing
participation from the scientific community. The past five years
have seen a rapid increase in the number of scientific journals in this field. It
is impossible to include all the topics of Bioinformatics in a book and still
cater to the needs of newcomers attracted to this field. This is an
introductory book that provides a balance between computational methods
and biological information. Instead of delving in depth, for each topic we
provide a broad but necessary content that will benefit readers with different
levels of expertise.
1 Introduction to Biological Systems........................................................1
Claude-Henry Volmar, Nikunj Patel, Amita N. Quadros,
Daniel Paris, Venkatarajan S. Mathura and Michael Mullan
1. Molecules of Life.................................................................................. 1
2. Nucleic Acids: DNA Versus RNA ....................................................... 2
3. Understanding Proteins: Sequence–Structure–Function....................... 4
4. Biological Systems, Signals, and Pathways.......................................... 5
5. Technological Advances and Their Benefits to Biology ...................... 7
6. The Role of Bioinformatics in Big Picture ........................................... 8
7. Exercises ............................................................................................... 9
References............................................................................................... 10
2 Computer Programming Fundamentals and Concepts ....................13
Deepak N. Kolippakkam, Pankaj Gupta
and Venkatarajan S. Mathura
1. Purpose ............................................................................................... 13
2. Learning Objective ............................................................................. 13
3. Perl Programming............................................................................... 14
3.1 Variables ....................................................................................... 14
3.2 Operators....................................................................................... 15
3.3 Control Structures ......................................................................... 16
3.4 Regular Expressions ..................................................................... 17
3.5 File Handling ................................................................................ 18
3.6 Subroutines and Functions............................................................ 18
xii Contents
4. PHP Programming .............................................................................. 19
4.1 Language Syntax and Data Types................................................. 19
4.2 Creating Web Interfaces ............................................................... 22
5. Basic RDBMS and SQL ..................................................................... 24
5.1 Data Definition Language (DDL)................................................. 24
5.2 Data Manipulation Language (DML) ........................................... 25
5.3 Data Control Language (DCL) ..................................................... 26
6. Web-Pointers ...................................................................................... 26
3 Introduction to Algorithms ..................................................................27
Senthilkumar Radhakrishnan, Deepak Kolippakkam
and Venkatarajan S. Mathura
1. Introduction......................................................................................... 27
1.1 Classification ................................................................................ 27
1.2 Hypothesis Testing ....................................................................... 28
1.3 Decision Tree................................................................................ 28
1.4 Clustering...................................................................................... 29
1.5 Principal Component Analysis ..................................................... 29
1.6 Multidimensional Scaling ............................................................. 29
1.7 Regression Analysis...................................................................... 29
1.8 Linear Discriminant Analysis ....................................................... 30
1.9 Fuzzy Logic .................................................................................. 30
1.10 Pattern Recognition..................................................................... 31
1.11 Bayesian Statistics ...................................................................... 31
1.12 Neural Networks ......................................................................... 32
1.13 Hidden Markov Model................................................................ 32
1.14 Support Vector Machines ........................................................... 33
2. Exercises ............................................................................................. 33
3. Useful Web-Pointers........................................................................... 34
References............................................................................................... 35
4 Biological Sequence Databases ............................................................39
Meena Sakharkar, Pandjassarame Kangueane
and Venkatarajan S. Mathura
1. Purpose ............................................................................................... 39
2. Learning Objective ............................................................................. 39
3. Introduction......................................................................................... 39
3.1 Genomic Sequence Databases – GenBank, EMBL, DDBJ .......... 41
3.2 Protein Sequence Databases ......................................................... 42
3.3 Secondary Databases on Molecular Evolution ............................. 44
References............................................................................................... 46
Contents xiii
5 Biological Sequence Search and Analysis...........................................47
Venkatarajan S. Mathura
1. Purpose ............................................................................................... 47
2. Learning Objectives............................................................................ 47
3. Introduction......................................................................................... 48
3.1 Similarity Matrices and Alignment............................................... 48
3.2 Sequence Search and Pair-Wise Alignment ................................. 50
3.3 Global Alignment Using Needleman-Wunsch Algorithm............ 51
3.4 Sequence Search Tools ................................................................. 53
3.5 Pair-Wise and Multiple-Sequence Alignment Tools .................... 55
3.6 Sequence Motifs ........................................................................... 57
References............................................................................................... 61
6 Protein Structure Prediction................................................................63
Hongyi Zhou, Yaoqi Zhou and Venkatarajan S. Mathura
1. Introduction......................................................................................... 63
2. Secondary Structure Prediction .......................................................... 65
3. Comparative Modeling ....................................................................... 66
3.1 Steps Involved in Comparative Modeling .................................... 67
3.2 Homologous Sequence Search Using Sequence
Comparison Tools......................................................................... 67
3.3 Identifying Remote Templates Using Fold-Recognition
Methods ........................................................................................ 68
3.4 Selection of the Alignment ........................................................... 69
3.5 Construction of 3D Models Using Modeling Programs ............... 69
3.6 Protein Modeling Package – MPACK.......................................... 70
3.7 SP3 – A Web-Based Structure-Prediction Tool Using
Known Protein Structures as Templates ....................................... 70
3.8 Modeling Servers.......................................................................... 73
3.9 Critical Assessment of Structure Prediction ................................. 74
3.10 Objective Testing of Modeling Tools in CASP.......................... 74
References............................................................................................... 75
7 Protein-Protein Interaction and Macromolecular Visualization...... 79
Arun Ramani, Venkatarajan S. Mathura, Cui Zhanhua
and Pandjassarame Kangueane
1. Introduction......................................................................................... 79
2. Experimental Methods........................................................................ 80
2.1 Yeast Two-Hybrid ........................................................................ 80
2.2 Affinity Tagging ........................................................................... 81
2.3 Computational Methods................................................................ 82
2.4 Co-evolution ................................................................................. 83
xiv Contents
2.5 Structure Based Methods .............................................................. 83
3. Protein Structure Visualization........................................................... 91
4. Databases ............................................................................................ 91
References............................................................................................... 93
8 Genes, Genomics, Microarray Methods and Analysis ...................... 97
Ghania Ait-Ghezala and Venkatarajan S. Mathura
1. Introduction......................................................................................... 97
2. Gene Identification and Characterization ........................................... 98
2.1 Identifying Human Genes and Cloning ........................................ 98
3. Microarray Experiments ................................................................... 102
3.1 Microarray Databases ................................................................. 104
3.2 Gene Annotations, Ontology, and Pathway Databases............... 104
References............................................................................................. 105
9 Introduction to Proteomics ................................................................107
Fai Poon and Venkatarajan S. Mathura
1. Introduction....................................................................................... 107
2. Sample Preparation ........................................................................... 108
3. Two-Dimensional (2D) Gel Electrophoresis .................................... 108
3.1 Image Analysis and Statistical Analysis ..................................... 109
3.2 In-Gel Digestion and Mass Spectrometry................................... 109
4. Mass Spectrometry ........................................................................... 109
4.1 Mass Spectrometry in Proteomics .............................................. 110
5. Bioinformatics Applications for Identification................................. 111
6. Conclusion ........................................................................................ 113
References............................................................................................. 113
10 Biomedical Literature Mining ...........................................................115
Chaolin Zhang and Michael Q. Zhang
1. Introduction....................................................................................... 115
2. Literature Sources for Mining........................................................... 117
3. Recognition of Biological Terms...................................................... 118
3.1 Gene/Protein Name Recognition ................................................ 119
3.2 Removing Gene/Protein Name Ambiguities .............................. 120
3.3 Collecting Other Keywords ........................................................ 120
4. Mining Biological Relationships ...................................................... 121
4.1 Detecting Gene Interactions by Co-occurrence .......................... 121
4.2 Inferring Implicit Relationships.................................................. 122
4.3 Identifying Sub-networks of Communities................................. 123
4.4 Evaluating Functional Coherence of Gene Group ...................... 124
References............................................................................................. 125
5. Acknowledgments ............................................................................ 124
Contents xv
11 Computational Immunology: HLA-Peptide Binding Prediction....129
Pandjassarame Kangueane, Bing Zhao and Meena K. Sakharkar
1. Background....................................................................................... 129
2. HLA Molecules ................................................................................ 131
3. HLA Binding Peptide Based Methods.............................................. 132
3.1 Sequence Based Prediction Models ............................................ 133
3.2 Molecular Structure Based Predictions....................................... 143
4. Conclusion ........................................................................................ 150
References............................................................................................. 151
12 Bioinformatics Application: Eukaryotic Gene
Count and Evolution ...........................................................................155
Meena K. Sakharkar and Pandjassarame Kangueane
1. Introduction....................................................................................... 155
2. Methodology..................................................................................... 156
2.1 Identification of SEG .................................................................. 156
2.2 Identification of MEG................................................................. 156
2.3 Pseudogenes................................................................................ 157
2.4 Caveats........................................................................................ 157
2.5 Total Genes ................................................................................. 158
3. Results and Discussion ..................................................................... 158
3.1 Utility of SEG and MEG Sequences to the Study of Evolution.... 158
3.2 Selection of SEG and MEG in Different Eukaryotic Genomes.... 158
3.3 Mechanism of SEG Origin ......................................................... 160
4. Conclusion ........................................................................................ 161
References............................................................................................. 162
13 Bioinformatics Application: Predicting Protein Subcellular
Localization by Applying Machine Learning ...................................163
Pingzhao Hu, Clement Chung, Hui Jiang and Andrew Emili
1. Introduction....................................................................................... 163
2. Methods ............................................................................................ 165
2.1 Data Sets and Preprocessing ....................................................... 165
2.2 Learning Algorithm .................................................................... 166
2.3 Evaluating Performance of the Learning Algorithm................... 167
2.4 Strategy for Multi-class/Multi-label Classification..................... 167
2.5 Optimal Sampling Methods for Imbalanced Data Sets............... 168
2.6 Algorithm of Asymmetric Bagging Strategy.............................. 169
3. Results............................................................................................... 170
4. Discussion......................................................................................... 172
References............................................................................................. 172
xvi Contents
14 Bioinformatics Analysis: Gene Fusion..............................................175
Meena Kishore Sakharkar, Yiting Yu
1. Introduction....................................................................................... 175
2. Identification of Fusion Proteins....................................................... 176
2.1 Human Fusion Proteins Mimicking Bacterial Operons .............. 177
2.2 Human Fusion Proteins Simulating Bacterial Subunit
Interfaces..................................................................................... 177
2.3 Fusion Proteins Exhibiting Multiple Functions .......................... 177
2.4 Fusion Proteins Showing Alternative Splicing ........................... 178
3. Remarks on Fusion Proteins ............................................................. 178
References............................................................................................. 180
Index ..........................................................................................................183
回复此楼
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

★★★★★ 五星级,优秀推荐

大哥你好多好东西啊
当年我生物信息学才60分,老师网开一面
有时间补补课
2楼2011-08-12 21:57:16
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

florayo

银虫 (正式写手)


★★★★★ 五星级,优秀推荐

学习学习~~
5楼2011-08-13 12:07:57
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
引用回帖:
2楼: Originally posted by 西瓜 at 2011-08-12 21:57:16:
大哥你好多好东西啊
当年我生物信息学才60分,老师网开一面
有时间补补课

六十分 貌似都是七十分及格吧
6楼2011-08-13 17:44:45
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
引用回帖:
5楼: Originally posted by florayo at 2011-08-13 12:07:57:
学习学习~~

学学也好 吼吼
7楼2011-08-13 17:45:00
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
引用回帖:
6楼: Originally posted by 1949stone at 2011-08-13 17:44:45:
六十分 貌似都是七十分及格吧

60分及格,本科的
8楼2011-08-13 17:46:19
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
引用回帖:
8楼: Originally posted by 西瓜 at 2011-08-13 17:46:19:
60分及格,本科的

哦 吼吼 那是万幸啊
9楼2011-08-13 17:46:44
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

xiaohaiyu

金虫 (正式写手)


★★★★★ 五星级,优秀推荐

非常感谢分享
11楼2011-08-15 12:00:02
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

springimu

木虫 (著名写手)


★★★★★ 五星级,优秀推荐

很不错,同行
14楼2011-08-16 12:26:57
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

水天sky

金虫 (小有名气)


★★★★★ 五星级,优秀推荐

顶一下,感谢分享!
非常感谢
17楼2011-10-05 16:49:38
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

teagirl3380

银虫 (初入文坛)


★★★★★ 五星级,优秀推荐

好东西啊,谢谢哈!
18楼2011-10-06 13:44:07
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

jiangleisc

金虫 (小有名气)


★★★★★ 五星级,优秀推荐

谢谢楼主
27楼2011-10-28 19:35:40
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

sunjiutong

金虫 (著名写手)


★★★★★ 五星级,优秀推荐

顶一下,感谢分享!特别感谢楼主,我正想学习这方面的知识呢
33楼2011-11-17 09:43:43
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

malstose

金虫 (正式写手)


★★★ 三星级,支持鼓励

多谢啦!
39楼2011-11-20 20:27:17
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
简单回复
jinhx873楼
2011-08-13 07:01   回复  
五星好评  顶一下,感谢分享!
2011-08-13 10:21   回复  
五星好评  顶一下,感谢分享!
july97510楼
2011-08-14 22:08   回复  
五星好评  顶一下,感谢分享!
okjstor12楼
2011-08-16 11:20   回复  
五星好评  顶一下,感谢分享!
Jarvis201013楼
2011-08-16 11:42   回复  
五星好评  顶一下,感谢分享!
hlhappycat15楼
2011-09-17 14:26   回复  
五星好评  
2011-10-05 13:02   回复  
五星好评  顶一下,感谢分享!
白云飞19楼
2011-10-06 21:59   回复  
五星好评  顶一下,感谢分享!
2011-10-11 12:19   回复  
五星好评  
zouqingyu21楼
2011-10-16 19:25   回复  
三星好评  顶一下,感谢分享!
hy83072722楼
2011-10-18 09:44   回复  
五星好评  顶一下,感谢分享!
changbo52923楼
2011-10-25 00:45   回复  
三星好评  顶一下,感谢分享!
nuoyaeva24楼
2011-10-25 17:19   回复  
五星好评  顶一下,感谢分享!
wnw677761425楼
2011-10-26 21:11   回复  
三星好评  顶一下,感谢分享!
wang00wmd26楼
2011-10-27 10:11   回复  
五星好评  顶一下,感谢分享!
jiangleisc28楼
2011-10-28 19:36   回复  
顶一下,感谢分享!
xiemin21729楼
2011-10-29 10:00   回复  
五星好评  顶一下,感谢分享!
korc30楼
2011-11-04 09:21   回复  
五星好评  顶一下,感谢分享!
2011-11-08 20:04   回复  
五星好评  顶一下,感谢分享!
shaoriyan32楼
2011-11-11 23:27   回复  
五星好评  顶一下,感谢分享!
urowufei34楼
2011-11-17 13:35   回复  
五星好评  顶一下,感谢分享!
chzwolf35楼
2011-11-17 21:30   回复  
五星好评  谢谢分享啊
sunjiutong36楼
2011-11-19 13:08   回复  
顶一下,感谢分享!
hbcobo37楼
2011-11-19 21:03   回复  
五星好评  顶一下,感谢分享!
nasi38楼
2011-11-20 01:02   回复  
五星好评  顶一下,感谢分享!
2011-11-21 12:06   回复  
五星好评  顶一下,感谢分享!
dhy48141楼
2011-11-21 21:35   回复  
五星好评  顶一下,感谢分享!
2011-11-23 15:31   回复  
五星好评  顶一下,感谢分享!
yangming9843楼
2011-11-28 20:37   回复  
五星好评  顶一下,感谢分享!
bst544楼
2012-02-07 10:39   回复  
五星好评  顶一下,感谢分享!
51929337845楼
2012-03-07 17:24   回复  
五星好评  顶一下,感谢分享!
2012-03-08 12:35   回复  
五星好评  顶一下,感谢分享!
2012-03-08 23:13   回复  
五星好评  顶一下,感谢分享!
feiyun100148楼
2012-03-09 11:57   回复  
五星好评  顶一下,感谢分享!
feiyun100149楼
2012-03-09 11:57   回复  
顶一下,感谢分享!
割麦者50楼
2012-05-22 17:15   回复  
五星好评  顶一下,感谢分享!
相关版块跳转 我要订阅楼主 1949stone 的主题更新
☆ 无星级 ★ 一星级 ★★★ 三星级 ★★★★★ 五星级
最具人气热帖推荐 [查看全部] 作者 回/看 最后发表
[基金申请] 刚刚收到科研之友邮件 +18 olivermiaoer 2024-06-19 25/1250 2024-06-20 12:34 by 凌绝顶
[基金申请] 面青地会评时间??? +7 Axvdvbfs 2024-06-19 8/400 2024-06-20 11:16 by 路遥还有谁
[基金申请] 工材口青年基金上会可能性 +8 今晚推荐22 2024-06-19 10/500 2024-06-20 11:00 by 手心里的幸福
[考博] 有机化学迷茫学生 +4 佛系摸鱼5 2024-06-18 6/300 2024-06-20 10:05 by 295143924
[基金申请] 太卷了 +14 laoyuefubio 2024-06-17 27/1350 2024-06-20 09:52 by htjwqy
[催化] 镍负载氧化铝的保存问题 8+3 lwn0130 2024-06-15 6/300 2024-06-20 09:00 by lwn0130
[找工作] 高校两个offer选择 +14 cowox2021 2024-06-18 15/750 2024-06-20 08:34 by 梦渺岚烟
[基金申请] 江南大学到瑞士招聘,称取消非升即走,改预聘+长聘 +21 babu2015 2024-06-18 22/1100 2024-06-19 23:03 by feng6531
[访问学者] 国家公派访问学者申请结果出了吗? +4 65syn 2024-06-13 4/200 2024-06-19 16:40 by 海洋之心168
[论文投稿] 求机械类四区sci推荐 5+4 迷茫小旷 2024-06-14 5/250 2024-06-19 14:08 by tangjie12345
[论文投稿] ACS AMI 返回审稿意见,一个大修,两个据稿,编辑给的修改重投 +5 智商已更新 2024-06-19 5/250 2024-06-19 12:35 by nono2009
[硕博家园] 关于硕博连读的一些疑问? +8 Lwenter 2024-06-14 10/500 2024-06-19 10:00 by qingdao001
[论文投稿] 审稿人含糊拒稿,还需要回复吗?如何回复? 20+4 BruceChum 2024-06-15 22/1100 2024-06-19 08:00 by kanyechris
[公派出国] CSC德国博后每个月资助多少呀?够用吗 +4 326lhpqk 2024-06-16 7/350 2024-06-19 02:03 by PLHOU
[基金申请] F口401需要啥文章水平 +3 lhjr123 2024-06-16 7/350 2024-06-18 16:05 by hon920603
[硕博家园] 博士毕业高校和就业的相关问题 +7 SCITOPPP 2024-06-14 11/550 2024-06-18 07:51 by yinxing1995
[论文投稿] 审稿问题:为什么荧光激发波长和紫外吸收波长差的大? 10+5 sdawege 2024-06-14 10/500 2024-06-17 18:54 by HH-探针
[基金申请] 关于博后基金的bug问题 +6 lxr1991 2024-06-14 9/450 2024-06-15 21:17 by since—2010
[基金申请] 博士后基金需要结题吗? +8 zhouchuck 2024-06-13 8/400 2024-06-14 17:27 by liuyupu132
[基金申请] 国自然基金公布的时候基金号有吗 +8 潇洒怡惜 2024-06-13 11/550 2024-06-14 11:24 by JRfei
信息提示
请填处理意见