Statistical Methods in Bioinformatics: An Introduction (Statistics for Biology and Health)
Average customer rating: 5 out of 5 stars
  • Most Elegant Account of Bioinformatics
Statistical Methods in Bioinformatics: An Introduction (Statistics for Biology and Health)
Warren J. Ewens , and Gregory Grant
Manufacturer: Springer
ProductGroup: Book
Binding: Hardcover

GeneralGeneral | Biology | Biological Sciences | Science | Subjects | Books
BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
GeneticsGenetics | Evolution | Science | Subjects | Books
GeneralGeneral | Medicine | Subjects | Books
BiostatisticsBiostatistics | Research | Medicine | Subjects | Books
BiostatisticsBiostatistics | Biological Sciences | Professional Science | Professional & Technical | Subjects | Books
Look Inside Science BooksLook Inside Science Books | Trip | Specialty Stores | Books
All Amazon UpgradeAll Amazon Upgrade | Amazon Upgrade | Stores | Books
Computers & InternetComputers & Internet | Amazon Upgrade | Stores | Books
MedicineMedicine | Amazon Upgrade | Stores | Books
Professional & TechnicalProfessional & Technical | Amazon Upgrade | Stores | Books
ScienceScience | Amazon Upgrade | Stores | Books
All TitlesAll Titles | Qualifying Textbooks - Fall 2007 | Stores | Books
Computers & InternetComputers & Internet | Qualifying Textbooks - Fall 2007 | Stores | Books
MedicineMedicine | Qualifying Textbooks - Fall 2007 | Stores | Books
ProfessionalProfessional | Qualifying Textbooks - Fall 2007 | Stores | Books
ScienceScience | Qualifying Textbooks - Fall 2007 | Stores | Books
Similar Items:
  1. An Introduction to Bioinformatics Algorithms (Computational Molecular Biology) An Introduction to Bioinformatics Algorithms (Computational Molecular Biology)
  2. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids
  3. Bioinformatics and Computational Biology Solutions Using R and Bioconductor (Statistics for Biology and Health) Bioinformatics and Computational Biology Solutions Using R and Bioconductor (Statistics for Biology and Health)
  4. Computational Genome Analysis: An Introduction (Statistics for Biology & Health) Computational Genome Analysis: An Introduction (Statistics for Biology & Health)
  5. Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology

Accessories:
  1. Evolutionary Bioinformatics Evolutionary Bioinformatics
  2. Introductory Statistics with R Introductory Statistics with R
  3. Fundamentals of Data Mining in Genomics and Proteomics Fundamentals of Data Mining in Genomics and Proteomics

ASIN: 0387400826

Book Description

Advances in computers and biotechnology have had a profound impact on biomedical research, and as a result complex data sets can now be generated to address extremely complex biological questions. Correspondingly, advances in the statistical methods necessary to analyze such data are following closely behind the advances in data generation methods. The statistical methods required by bioinformatics present many new and difficult problems for the research community.

This book provides an introduction to some of these new methods. The main biological topics treated include sequence analysis, BLAST, microarray analysis, gene finding, and the analysis of evolutionary processes. The main statistical techniques covered include hypothesis testing and estimation, Poisson processes, Markov models and Hidden Markov models, and multiple testing methods.

The second edition features new chapters on microarray analysis and on statistical inference, including a discussion of ANOVA, and discussions of the statistical theory of motifs and methods based on the hypergeometric distribution. Much material has been clarified and reorganized.

The book is written so as to appeal to biologists and computer scientists who wish to know more about the statistical methods of the field, as well as to trained statisticians who wish to become involved with bioinformatics. The earlier chapters introduce the concepts of probability and statistics at an elementary level, but with an emphasis on material relevant to later chapters and often not covered in standard introductory texts. Later chapters should be immediately accessible to the trained statistician. Sufficient mathematical background consists of introductory courses in calculus and linear algebra. The basic biological concepts that are used are explained, or can be understood from the context, and standard mathematical concepts are summarized in an Appendix. Problems are provided at the end of each chapter allowing the reader to develop aspects of the theory outlined in the main text.

Warren J. Ewens holds the Christopher H. Brown Distinguished Professorship at the University of Pennsylvania. He is the author of two books, Population Genetics and Mathematical Population Genetics. He is a senior editor of Annals of Human Genetics and has served on the editorial boards of Theoretical Population Biology, GENETICS, Proceedings of the Royal Society B and SIAM Journal in Mathematical Biology. He is a fellow of the Royal Society and the Australian Academy of Science.

Gregory R. Grant is a senior bioinformatics researcher in the University of Pennsylvania Computational Biology and Informatics Laboratory. He obtained his Ph.D. in number theory from the University of Maryland in 1995 and his Masters in Computer Science from the University of Pennsylvania in 1999.

Comments on the First Edition. "This book would be an ideal text for a postgraduate course…[and] is equally well suited to individual study…. I would recommend the book highly" (Biometrics). "Ewens and Grant have given us a very welcome introduction to what is behind those pretty [graphical user] interfaces" (Naturwissenschaften.). "The authors do an excellent job of presenting the essence of the material without getting bogged down in mathematical details" (Journal. American Staistical. Association). "The authors have restructured classical material to a great extent and the new organization of the different topics is one of the outstanding services of the book" (Metrika).

Customer Reviews:

5 out of 5 stars Most Elegant Account of Bioinformatics.......2004-11-27

I was impressed with the 1st edition of this book for its most comprehensive and elegant of statistical techniques in bioinformatics. The book is slightly below the level of the now classic M S Waterman (1995)book:Introduction to Computational Biology: Maps, Sequences and Genomes. But this book is more update in some areas and has much more background materials on probability and statistics, which should provide a solid basis for understanding bioinformatics. Its pedagorical sense is unparalleled. It would make a very good choice for a stat/math oriented introduction to bioinformatics (as opposed to algorithimc/database oriented approach in cs).
Microarray Bioinformatics
Average customer rating: 4 out of 5 stars
  • Great Introduction to Microarray Analysis
  • Neat little book on microarrays
  • If you are new to microarray, get this book.
  • an intro. for biologists
  • A Good Book for Microarray Bioinformatics
Microarray Bioinformatics
Dov Stekel
Manufacturer: Cambridge University Press
ProductGroup: Book
Binding: Paperback

BiochemistryBiochemistry | Biological Sciences | Science | Subjects | Books
GeneralGeneral | Biology | Biological Sciences | Science | Subjects | Books
Molecular BiologyMolecular Biology | Biology | Biological Sciences | Science | Subjects | Books
MicrobiologyMicrobiology | Biology | Biological Sciences | Science | Subjects | Books
BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
GeneticsGenetics | Evolution | Science | Subjects | Books
GeneralGeneral | Science | Subjects | Books
History of ScienceHistory of Science | History & Philosophy | Science | Subjects | Books
GeneticsGenetics | Basic Science | Medicine | Subjects | Books
BiochemistryBiochemistry | Bioengineering | Engineering | Professional & Technical | Subjects | Books
BiochemistryBiochemistry | Biological Sciences | Professional Science | Professional & Technical | Subjects | Books
GeneticsGenetics | Evolution | Professional Science | Professional & Technical | Subjects | Books
GeneticsGenetics | Basic Sciences | Medical | Professional & Technical | Subjects | Books
All DealsAll Deals | Blowout Books | Stores | Books
ScienceScience | Blowout Books | Stores | Books
All TitlesAll Titles | Qualifying Textbooks - Fall 2007 | Stores | Books
MedicineMedicine | Qualifying Textbooks - Fall 2007 | Stores | Books
ProfessionalProfessional | Qualifying Textbooks - Fall 2007 | Stores | Books
ScienceScience | Qualifying Textbooks - Fall 2007 | Stores | Books
Similar Items:
  1. Data Analysis Tools for DNA Microarrays Data Analysis Tools for DNA Microarrays
  2. DNA Microarrays and Gene Expression: From Experiments to Data Analysis and Modeling DNA Microarrays and Gene Expression: From Experiments to Data Analysis and Modeling
  3. Instant Notes in Bioinformatics (Instant Notes Series) Instant Notes in Bioinformatics (Instant Notes Series)
  4. Microarrays for an Integrative Genomics (Computational Molecular Biology) Microarrays for an Integrative Genomics (Computational Molecular Biology)
  5. Bioinformatics: Sequence and Genome Analysis Bioinformatics: Sequence and Genome Analysis

ASIN: 052152587X

Book Description

DNA microarrays have revolutionized molecular biology and are becoming a standard tool in the field. Dov Stekel's book is a comprehensive guide to the mathematics, statistics, and computing required to use microarrays successfully. Unlike traditional molecular biology, the successful use of DNA microarrays requires the application of statistics and computing to design the arrays and experiments, and to analyze and manage the data. This book is written for researchers, clinicians, and laboratory managers.

Download Description

This book is a comprehensive guide to all of the mathematics, statistics and computing you will need to successfully operate DNA microarray experiments. It is written for researchers, clinicians, laboratory heads and managers, from both biology and bioinformatics backgrounds, who work with, or who intend to work with microarrays. The book covers all aspects of microarray bioinformatics, giving you the tools to design arrays and experiments, to analyze your data, and to share your results with your organisation or with the international community. There are chapters covering sequence databases, oligonucleotide design, experimental design, image processing, normalisation, identifying differentially expressed genes, clustering, classification and data standards. The book is based on the highly successful Microarray Bioinformatics course at Oxford University, and therefore is ideally suited for teaching the subject at postgraduate or professional level.

Customer Reviews:

5 out of 5 stars Great Introduction to Microarray Analysis.......2006-05-12

This is an excellent introduction to microarray analysis. It is great at explaining the theory behind normalization, clustering, and dimensionality reduction without getting hung up on the statistics behind it. If you are looking for an exhaustive statistical treatment on the topic, this is not the book. But it will give you excellent background on these techniques that make reading statistical papers on the topic much easier for the non-statistics biologist.

Highly recommended.

4 out of 5 stars Neat little book on microarrays.......2006-03-24

Without question this short paperback is a nifty little text. What it does is provide the beginner with a basic brief overview in covering all major aspects of microarrays.

What you have to keep in mind is this book is intended for those who want a brief overview of all aspects of microarrays. Its a "forest for the trees" book on microarrays. The writing is very good and easy to follow, and its a great introductory text and reasonably priced.

Regardless of ones formal training, (e.g. Biology, Statistics, Computer Science, ... , health science) I think it would make an excellent little basic reference on ones bookshelf or to just have around in the lab for undergraduates/beginning graduate students.

Bottomline: If you prefer to learn things by starting at the start and not at the end then consider this book; Indeed its a great starter book to get your feet a little wet before jumping in over your head to the more gnarly stuff.

5 out of 5 stars If you are new to microarray, get this book........2005-05-16

This book describes basic concepts and procedures for those who are new to microarray. I'd recommend that a reader should use this book to grasp what microarray is. You won't be able to know anything in depth from this book but it will be nice to have this if you have trouble in understanding a more challenging book. Once you read this book, please go ahead and read another book since this book doesn't tell you everything about microarray. It's just a basic overview... i was glad that I used this book as my first microarray textbook....

4 out of 5 stars an intro. for biologists.......2004-09-08

This book is written clearly, which also means it doesn't touch too deep. I believe it's mainly useful for biologists who want to get a brief and application oriented introduction, but not for the researchers that want to improve the technology.

5 out of 5 stars A Good Book for Microarray Bioinformatics.......2004-01-04

I rate this book a 5 star because I believe this book is one of best bioinformatics books which make it possible for the biologists to understand the bioinformatic tools inside of microarray technology. For me the most useful chapters include Sequence Databases for Microarrays, Computer Design of Oligonucleotide Probes, Normalisation, Measuring and Quantifying Microarray Variability, Analysis of Differentially Expressed Genes. As a three-years microarray user, I still get a lot information after I read this book. However, no any bioinformatic books are perfect and complete. There are also some limitations in this book. The author sometimes did not provide detailed information on some biostatistic analysis tools and only provided some references for reading. Since a lot of bioinformatic tools are still in the trial stage and need to be improved, we can not blame the author for the incompleteness.
As a 250-pages bioinformatics book, I believe, this book is very informative and useful for microarray users and biologists who are tired of understanding the abstract biostatistic equations.
Statistical Methods in Bioinformatics
Average customer rating: 3.5 out of 5 stars
  • Misleading title!
  • Great all-around review of probability
  • Disappointing overview
  • Pretty good overview
  • guide into the right direction
Statistical Methods in Bioinformatics
Warren J. Ewens , and Gregory R. Grant
Manufacturer: Springer
ProductGroup: Book
Binding: Hardcover

GeneralGeneral | Biology | Biological Sciences | Science | Subjects | Books
BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
GeneticsGenetics | Evolution | Science | Subjects | Books
GeneralGeneral | Science | Subjects | Books
GeneralGeneral | Applied | Mathematics | Science | Subjects | Books
Probability & StatisticsProbability & Statistics | Applied | Mathematics | Science | Subjects | Books
GeneralGeneral | Research | Medicine | Subjects | Books
Communication & WritingCommunication & Writing | Reference | Medicine | Subjects | Books
GeneralGeneral | Biology | Biological Sciences | Professional Science | Professional & Technical | Subjects | Books
BiostatisticsBiostatistics | Biological Sciences | Professional Science | Professional & Technical | Subjects | Books
GeneralGeneral | Applied | Mathematics | Professional Science | Professional & Technical | Subjects | Books
StatisticsStatistics | Applied | Mathematics | Professional Science | Professional & Technical | Subjects | Books
ResearchResearch | Medical | Professional & Technical | Subjects | Books
BiologyBiology | Medicine | New & Used Textbooks | Stores | Books
Biostatistics & EpidemiologyBiostatistics & Epidemiology | Medicine | New & Used Textbooks | Stores | Books
GeneralGeneral | Medicine | New & Used Textbooks | Stores | Books
StatisticsStatistics | Mathematics | Sciences | New & Used Textbooks | Stores | Books
GeneralGeneral | Mathematics | Sciences | New & Used Textbooks | Stores | Books
GeneralGeneral | Communications | Humanities | New & Used Textbooks | Stores | Books
All TitlesAll Titles | Qualifying Textbooks - Fall 2007 | Stores | Books
MedicineMedicine | Qualifying Textbooks - Fall 2007 | Stores | Books
ProfessionalProfessional | Qualifying Textbooks - Fall 2007 | Stores | Books
ScienceScience | Qualifying Textbooks - Fall 2007 | Stores | Books
Accessories:
  1. Academic Scientists at Work Academic Scientists at Work
  2. The Science of Cooking The Science of Cooking
  3. Nanobiotechnology of Biomimetic Membranes (Fundamental Biomedical Technologies) Nanobiotechnology of Biomimetic Membranes (Fundamental Biomedical Technologies)

ASIN: 0387952292

Book Description

Advances in computers and biotechnology have had an immense impact on the biomedical fields, with broad consequences for humanity. Correspondingly, new areas of probability and statistics are being developed specifically to meet the needs of this area. There is now a necessity for a text that introduces probability and statistics in the bioinformatics context. This book also describes some of the main statistical applications in the field, including BLAST, gene finding, and evolutionary inference, much of which has not yet been summarized in an introductory textbook format. This book grew out of a need to teach bioinformatics to graduate students at the University of Pennsylvania. At the same time however, it is organized to appeal to a wider audience. In particular it should appeal to any biologist or computer scientist who wants to know more about the statistical methods of the field, as well as to a trained statistician who wishes to become involved in bioinformatics. The earlier chapters introduce the concepts of probability and statistics at an elementary level, and will be accessible to students who have only had introductory calculus and linear algebra. Later chapters are immediately accessible to the trained statistician. Only a basic understanding of biological concepts is assumed, and all concepts are explained when used or can be understood from the context. Several chapters contain material independent of that in other chapters, so that the reader interested in certain areas can proceed directly to those areas.

Warren Ewens is Professor of Biology at the University of Pennsylvania. He is the author of two books, Population Genetics and Mathematical Population Genetics, and has served on the editorial boards of Theoretical Population Biology, GENETICS, Proceeding of the Royal Society B and SIAM Journal in Mathematical Biology. He was recently awarded the Gold Medal of the Australian Statistical Society and elected as Fellow of the Royal Society. His research interests are in evolutionary population genetics, linkage analysis for human diseases, and bioinformatics.

Gregory Grant is a bioinformatics researcher at the University of Pennsylvania in the Computational Biology and Informatics Laboratory (CBIL), where he has been since 1998. In 1995 he received a Ph.D. in Mathematics from the University of Maryland and in 1999 a Masters in Computer Science from the University of Pennsylvania. His research interests are in bioinformatics in general and in particular in the statistical analysis of gene expression data and significance testing methods for IBD-mapping.

Customer Reviews:

2 out of 5 stars Misleading title!.......2004-12-12

A better title for this book would be 'How Blast works' because this book is centered around this topic. If you expect a general overview of statistical methods used in bioinformtics you should buy another book (e.g. Hastie, Baldi, Pevzner, Duda, Eddy which provide more general methods). If you want to know in mathematical detail how blast works, this is your book. I think the level is advanced and one needs some mathematical background to appreciate it (certainly not to recommend for biologists).

I don't think it is a really bad book but I think it gives a wrong impression of (statistical) methods in bioinformatics. Another reviewer wrote ...This is one of the books I have been waiting for. For a population geneticist who wants to learn bioinformatics, most texts are unacceptable: They present heuristic methods in a cookbook fashion, with little reference to what is going on biologically as well as mathematically....
This is exactly the problem with this book!! Bioinformatics is more machine learning than statistics and more heuristic then exact.

5 out of 5 stars Great all-around review of probability .......2004-08-17

The book's title says 'Statistical Methods', but all of statistics is derived from probability theory. That's really where Ewens and Grant start, with the best high-density review of probability I know.

The first two chapters cover probabilities of one and many variables, respectively. This includes several topics that other authors equently skip, including conditional and marginal probabilities, probability- and moment-generating functions, a little about entropy, distributions of sums, and extreme value statistics. All that takes about 100 pages. Two later chapters cover statistical inference (parameter estimation, hypothesis testing, and Bayesian techniques), two more cover stochastic processes including Markov models, a short chapter includes hidden Markov models and their training, and another chapter covers sampling techniques: bootstraps, permutation tests and such.

If the book contained only that material, it would still be a valuable review and summary of basic probability. It's way too dense to be a beginner's text. That's OK, those chapters were really intended as a review and as a statement of the terms and notation used in the book's real objectives: models of biological systems.

The chapters on biological applications are interspersed with chapters on basics, so that each application is presented as soon as its elements are covered. Those chapters describe statistical properties of a single DNA or protein string, relationships between two strings, BLAST and its scoring models, mutation modeling, and construction of phylogenetic trees. Coverage of each topic is brief but very dense. A surprising amount of information is packed into each brief chapter, and it's surprisingly readable. Still, these are big topics. Ewens and Grant don't and don't try to present any topic to its full depth. Instead, they give enough discussion that a determined reader can learn the basics, and can understand more advanced discussions of specific topics.

The book does require a determined reader with some background in probability - this shouldn't be anyone's first book, unless you have a very skilled teacher. The prepared and careful reader will be very well rewarded, however. Despite the book's title about statistics and bioinformatics, this is a reference you may use for probability models in any field. It's certainly one that I keep coming back to.

//wiredweird

2 out of 5 stars Disappointing overview.......2003-11-12

This book is a tremendous disappointment, given other Amazon reviews and the impressive Table of Contents. I picked several topics about which I know something: Likelihoods, P-values, bootstraps. I would have had NO idea about either of these subjects based on the poor delivery in this book. Topics are not well introduced, there are virtually no examples, and the introduction/discussion of most topics is wordy and not informative.

A topic such as the two-sample t-statistic is scattered throughout the book, with the main part not even cited in the index!

Unfortunately there are not a lot of books in the field of Statistics in Bioinformatics. However, I would recommend "The Elements of Statistical Learning" (Hastie et al.) for classifiers etc (Duda and Hart's classic is also good). I would recommend "Biostatistical Analysis" by Zar for a general coverage, and Terry Speed's "stat Labs: Mathematical Statistics ..." which is not comprehensive but has good lab examples with associated statistical analysis.

4 out of 5 stars Pretty good overview.......2002-09-19

This book is a timely introduction to the mathematical statistics used in computational biology and bioinformatics. The authors have done a superb job in the overview of a subject that students of biology and bioinformatics can rely on for study and for reference. The mathematics is done at an advanced undergraduate level, but the authors are pragmatic in their approach, and interlace the discussion with biological applications immediately after the appropriate mathematical background has been developed. It thus seems appropriate to discuss the quality of the presentation with these applications in mind.

Chapter one begins, appropriately, with an introduction to probability theory, with a consideration of discrete probability distributions of one variable beginning the chapter. The Bernoulli, binomial, uniform, geometric, generalized geometric, and Poisson distributions are discussed. The authors point out the use of geometric-like distributions in the BLAST application. The also caution the reader as to the difference between the mean and the average of a random variable. They then move on to consider continuous distributions, discussing briefly the uniform, Normal, exponential, gamma, and beta distributions. Moment-generating functions are also introduced, and they prove a "convexity" theorem for these functions that is important in the BLAST application. The authors also introduce the relative entropy and generalized support statistics, the later also being used in BLAST.

The next chapter is an overview of probability theory in many random variables. The results in chapter one are discussed in this context, and the authors give an interesting application to the sequencing of EST libraries. The authors also point out that the variance of the maximum of a collection random variables is finite as the number of variables increases, a fact that is used quite often in bioinformatics. Transformations of random variables are also discussed, with the goal of showing how these can be used to find the density function of a single random variable, this also being important in BLAST.

The most important subject of the book begins in chapter 3, wherein the authors introduce statistical inference. They begin with a very brief discussion of the differences between the frequentist and Bayesian approaches to statistical inference and then move on to classical hypothesis testing and nonparametric tests. This chapter is of great value to those readers, for example biologists/would-be bioinformaticists who are approaching statistics for the first time.

Chapter 4 introduces concepts that are of upmost importance in probabilistic computational biology, namely Markov chains. The discussion in this chapter sets up the strategies used in the next chapter on analyzing a single DNA sequence and a latter chapter on hidden Markov models. Shotgun sequencing is discussed as a tool to determine the an actual DNA sequence, and the authors discuss the probabilistic issues that arise in the reconstruction of long DNA sequences from shorter sequences. Missing in this chapter is a mathematical analysis of the advantages/disadvantages between shotgun and whole genome sequencing strategies.

Chapter 6 then generalizes the analysis of chapter 5 to multiple DNA and protein sequences. It is here that one begins to talk about alignments between sequences, which bring about some very subtle mathematical problems in computational biology. The computational complexity of the (global) alignment problem entails the use of softer techniques, such as dynamic programming, which is discussed in this chapter. The (local) alignment problem is also discussed in some detail, using the linear gap model. The alignment problem and the issues with scoring for protein sequences are also discussed in detail. The reader first encounters the famous PAM and BLOSUM matrices in this chapter. The authors do not discuss any connections with the protein folding problem, unfortunately.

The next chapter introduces the basic probability theory behind the BLAST algorithm, namely random walks. They do so with emphasis on moment generating functions, which might be a little abstract for the biologist reader.

The authors return to tatistical estimation and hypothesis testing in chapter 8, with maximum liklihood and fixed sample size tests discussed in some detail. Again connecting with the BLAST algorithm, the sequential probability ratio test is treated.

The authors finally get down to the BLAST algorithm in chapter 9, using an older version of the software (1.4). The connection of the algorithm with random walks and how to assign scores is immediately apparent, as is the ability of BLAST to do database queries against a chosen sequence. The algorithm is compared with the sequential analysis discussed in the last chapter.

The authors return to Markov chains in chapter 10, and give some numerical examples. In addition, they treat the important topic of Markov chain Monte Carlo via the Hastings-Metropolis algorithm, Gibbs sampling, and simulated annealing. An application of simulated annealing to the double digest problem is described. The authors also spend a litte time discussing continuous-time Markov chains.

Hidden Markov models are finally discussed in chapter 11. These have been the most effective tools in sequence analysis and the authors give a nice overview of their construction and properties in this chapter. The Pfam package is discussed as a software implementation of HMMs for determining protein domains. Unfortunately, they do not discuss the excellent package HMMER for implementing HMMs in sequence analysis.

Chapter 12 discusses computationally intensive methods in classical inference. One of these methods, the bootstrap procedure, which is used for large sample sizes, is described. Used to estimate confidence intervals in situations where there is not enough information to employ classical methods, the authors detail a method using quantiles to estimate the confidence interval for the standard deviation of the expression intensity of a gene. This is followed by a return to the multiple testing problem of chapter 3 in the context of the data analysis of expression arrays.

I did not read the last two chapters on evolutionary models and phylogenetic tree estimation so I will omit their review.

5 out of 5 stars guide into the right direction.......2001-09-06

This is one of the books I have been waiting for. For a population geneticist who wants to learn bioinformatics, most texts are unacceptable: They present heuristic methods in a cookbook fashion, with little reference to what is going on biologically as well as mathematically.

This book is the first exception I know of. It builds, and rests on, solid foundations of genetic stochastic processes and still goes all the way to real-life problems. Let me illustrate this by means of an example, rather than enumerating all the topics in the book.

Chap. 14, entitled `phylogenetic tree estimation' (as opposed to the more common term `phylogenetic tree reconstruction' - not without reason, I presume) builds on, and is firmly interlaced with, Chap. 13 about `evolutionary models', which systematizes the zoo (if not jungle) of substitution models in both discrete and continuous time. On this basis, the overview of tree-building methods makes a lot of sense. Even better, it does not stop here, but presents an application (to real sequence data), followed by a careful analysis of where the various methods agree, and where - and maybe why - they disagree. This way, it clears away some common misconceptions; in particular, it presents a careful analysis of what bootstrap does and what it does not in this context. The chapter closes with a discussion of unresolved problems (like inhomogeneity of substitution rates), and methods and possible pitfalls related to testing of nested and non-nested hypotheses in tree estimation.

The book is written in an informal style without being imprecise, which makes it pleasant reading. It is particularly suitable for teaching at a high level. This is enhanced by realistic (and even real-life) examples that furnish the text, as well as carefully chosen exercises at the end of each chapter.

Certainly, this first edition of `Statistical Methods in Bioinformatics' cannot be the last word in this fast-moving field. But it is an excellent guide into the `right' direction.
New Directions in Statistical Physics: Econophysics, Bioinformatics, and Pattern Recognition
Average customer rating: Not rated
    New Directions in Statistical Physics: Econophysics, Bioinformatics, and Pattern Recognition

    Manufacturer: Springer
    ProductGroup: Book
    Binding: Hardcover

    GeneralGeneral | Science | Subjects | Books
    Probability & StatisticsProbability & Statistics | Applied | Mathematics | Science | Subjects | Books
    GeneralGeneral | Physics | Science | Subjects | Books
    Mathematical PhysicsMathematical Physics | Physics | Science | Subjects | Books
    General & ReferenceGeneral & Reference | Technology | Science | Subjects | Books
    BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
    GeneralGeneral | Physics | Professional Science | Professional & Technical | Subjects | Books
    Mathematical PhysicsMathematical Physics | Physics | Professional Science | Professional & Technical | Subjects | Books
    All Amazon UpgradeAll Amazon Upgrade | Amazon Upgrade | Stores | Books
    Professional & TechnicalProfessional & Technical | Amazon Upgrade | Stores | Books
    ScienceScience | Amazon Upgrade | Stores | Books
    All TitlesAll Titles | Qualifying Textbooks - Fall 2007 | Stores | Books
    ProfessionalProfessional | Qualifying Textbooks - Fall 2007 | Stores | Books
    ScienceScience | Qualifying Textbooks - Fall 2007 | Stores | Books
    ASIN: 3540431829

    Book Description

    Statistical physics addresses the study and understanding of systems with many degrees of freedom. As such it has a rich and varied history, with applications to thermodynamics, magnetic phase transitions, and order/disorder transformations, to name just a few. However, the tools of statistical physics can be profitably used to investigate any system with a large number of components. Thus, recent years have seen these methods applied in many unexpected directions, three of which are the main focus of this volume. These applications have been remarkably successful and have enriched the financial, biological, and engineering literature. Although reported in the physics literature, the results tend to be scattered and the underlying unity of the field overlooked. This book provides a unique insight into the latest breakthroughs in a consistent manner, at a level accessible to undergraduates, yet with enough attention to the theory and computation to satisfy the professional researcher.
    Computational and Statistical Methods in Bioinformatics
    Average customer rating: Not rated
      Computational and Statistical Methods in Bioinformatics
      Xue-wen Chen , George C. Tseng , Xinkun Wang , and Ya Zhang
      Manufacturer: Chapman & Hall/CRC
      ProductGroup: Book
      Binding: Paperback

      BiotechnologyBiotechnology | Biological Sciences | Science | Subjects | Books
      GeneralGeneral | Biology | Biological Sciences | Science | Subjects | Books
      BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
      GeneralGeneral | Science | Subjects | Books
      Probability & StatisticsProbability & Statistics | Applied | Mathematics | Science | Subjects | Books
      GeneralGeneral | Biology | Biological Sciences | Professional Science | Professional & Technical | Subjects | Books
      BiotechnologyBiotechnology | Biological Sciences | Professional Science | Professional & Technical | Subjects | Books
      StatisticsStatistics | Applied | Mathematics | Professional Science | Professional & Technical | Subjects | Books
      ASIN: 1420070541
      Hidden Markov Models for Bioinformatics (Computational Biology)
      Average customer rating: 3.5 out of 5 stars
      • Written by a mathematician for mathematicians
      • Good material, but you really have to want it.
      • Primarily for bio-mathematicians
      Hidden Markov Models for Bioinformatics (Computational Biology)
      T. Koski
      Manufacturer: Springer
      ProductGroup: Book
      Binding: Hardcover

      BiochemistryBiochemistry | Biological Sciences | Science | Subjects | Books
      BiotechnologyBiotechnology | Biological Sciences | Science | Subjects | Books
      GeneralGeneral | Biology | Biological Sciences | Science | Subjects | Books
      BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
      GeneticsGenetics | Evolution | Science | Subjects | Books
      GeneralGeneral | Science | Subjects | Books
      GeneralGeneral | Mathematics | Science | Subjects | Books
      Probability & StatisticsProbability & Statistics | Applied | Mathematics | Science | Subjects | Books
      GeneralGeneral | Medicine | Subjects | Books
      BiotechnologyBiotechnology | Bioengineering | Engineering | Professional & Technical | Subjects | Books
      BiochemistryBiochemistry | Biological Sciences | Professional Science | Professional & Technical | Subjects | Books
      BiotechnologyBiotechnology | Biological Sciences | Professional Science | Professional & Technical | Subjects | Books
      GeneticsGenetics | Evolution | Professional Science | Professional & Technical | Subjects | Books
      StatisticsStatistics | Applied | Mathematics | Professional Science | Professional & Technical | Subjects | Books
      All TitlesAll Titles | Qualifying Textbooks - Fall 2007 | Stores | Books
      MedicineMedicine | Qualifying Textbooks - Fall 2007 | Stores | Books
      ProfessionalProfessional | Qualifying Textbooks - Fall 2007 | Stores | Books
      ScienceScience | Qualifying Textbooks - Fall 2007 | Stores | Books
      Similar Items:
      1. An Introduction to Bioinformatics Algorithms (Computational Molecular Biology) An Introduction to Bioinformatics Algorithms (Computational Molecular Biology)
      2. Statistical Methods in Bioinformatics: An Introduction (Statistics for Biology and Health) Statistical Methods in Bioinformatics: An Introduction (Statistics for Biology and Health)
      3. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids
      4. Inference in Hidden Markov Models (Springer Series in Statistics) Inference in Hidden Markov Models (Springer Series in Statistics)
      5. Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology

      ASIN: 1402001355

      Book Description

      The purpose of this book is to give a thorough and systematic introduction to probabilistic modeling in bioinformatics. The book contains a mathematically strict and extensive presentation of the kind of probabilistic models that have turned out to be useful in genome analysis. Questions of parametric inference, selection between model families, and various architectures are treated. Several examples are given of known architectures (e.g., profile HMM) used in genome analysis.
      Audience: This book will be of interest to advanced undergraduate and graduate students with a fairly limited background in probability theory, but otherwise well trained in mathematics and already familiar with at least some of the techniques of algorithmic sequence analysis.

      Customer Reviews:

      2 out of 5 stars Written by a mathematician for mathematicians.......2004-03-11

      The intended audience of this book are mathematicians. To understand this book, you should have prior coursework experience in at least several upper division undergraduate courses in mathematical statistics and probability theory. The structure of this book is also that of a typical math book; full of proposition, corollary, lemma, etc, and very limited use of illustrations (e.g., there is no single figure up to chapter 6).

      I wanted a book with a mathematical sophistication simliar to Durbin's book, but this book is way more than that. On the other hand, I showed this book to a mathematics graduate student and she said this book is perfect for her. So I guess this book is written by a mathematician only for mathematicians.

      5 out of 5 stars Good material, but you really have to want it........2003-10-10

      The book gives outstanding coverage of all that goes into building HMMs - one of the most important tools in genome analysis and structure prediction. It covers the field in extreme depth. More depth, in fact, than needed for building useful HMM systems. It not only presents the forward and backward algorithms leading up to Baum-Welch, it presents all the extras - convergence, etc.

      This additional depth of coverage may go beyond many readers' needs. It is very helpful, though, for people who need more than the usual algorithms. By giving the background in such detail, a persistent reader can follow to a certain point, then create modifications with a clear idea of where the new algorithm actually comes from.

      Regarding the current practice of HMM usage, I found it a bit thin. Widely-known tools based on HMMs are mentioned only occasionally and in passing, and HMM-based alignment is discussed only briefly. Well, this book isn't for the tool user. Perhaps more important, I found scant mention of scoring with respect to some background probability model ("null" model, as it's called here).

      My one real complaint, and this is truly minor, is the quality of illustration. The line-drawings look like Word pictures - not necessarily a bad thing, if done well. These aren't particularly professional-looking, though, and oddly stretched or squashed in many cases. Still, they're readable enough and make all the needed points.

      A lesser point, and not the author's fault, is the editorial implication that this book introduces probabilitic models in general. It does not. This is strictly about HMMs, not Bayesian nets, bootstrap techniques, or any of the dozens of other probabilistic models used in bioinformatics. That is not a flaw of the book, just a flaw in how it's represented.

      If you are dedicated to becoming an expert in HMM construction and application, you must have this book. It's a bit much, though, for people who just want the results that HMMs give.

      4 out of 5 stars Primarily for bio-mathematicians.......2003-07-01

      The field of computational biology has expanded greatly in the last decade, mainly due to the increasing role of bioinformatics in the genome sequencing projects. This book outlines a particular set of algorithms called hidden Markov models, that are used frequently in genetic sequence search routines. The book is primarily for mathematicians who want to move into bioinformatics, but it could be read by a biologist who has a strong mathematical background. The book is detailed at some places, sparse in others, and reads like a literature survey at times, but many references are given, and there are very interesting exercises at the end of each chapter section. In fact it is really imperative that the reader work some of these exercises, as the author proves some of the results in the main body of the text via the exercises.

      Some of the highlights of the book include: 1. An overview of the probability theory to be used in the book. The material is fairly standard, including a review of continuous and discrete random variables, from the measure-theoretic point of view, i.e the author introduces them via a probability space which is set with its sigma field, and a probability measure on this field. The weight matrix or "profile" as it is sometimes called, is defined, this having many applications in bioinformatics. Bayesian learning is also discussed, and the author introduces what he calls the "missing information principle", and is fundamental to the probabilistic modeling of biological sequences. Applications of probability theory to DNA analysis are discussed, including shotgun assembly and the distribution of fragment lengths from restriction digests. A collection of interesting exercises is included at the end of the chapter, particularly the one on the null model for pairwise alignments. 2. An introduction to information theory and the relative entropy or "Kullback distance", the latter of which is used to learn sequence models from data. The author defines the mutual information between two probability distributions and the entropy, and calculates the latter for random DNA. He also proves some of the Shannon source coding theorems, one being the convergence to the entropy for independent, identically distributed random variables. The Kullback distance is then defined, as a distance between probability distributions, with the caution that it is not a metric because of lack of symmetry. 3. The overview of probabilistic learning theory, where 'learning from data' is defined as the process of inferring a general principle from observations of instances. 4. The very detailed treatment of the EM algorithm, including the discussion of a model for fragments with motifs. 5. The discussion of alignment and scoring, especially that of global similarity. Local alignment is treated in the exercises. 6. The discussion of the learning of Markov chains via Bayesian modeling applied to a training sequence via a family of Markov models. Frame dependent Markov chains are discussed in the context of Markovian models for DNA sequences. 7. The discussion of influence diagrams and nonstandard hidden Markov models, in particular the excellent diagrams drawn to illustrate the main properties, and excellent discussion is given of an "HMM with duration" in the context of the functional units of a eukaryotic gene. This is important in the GeneMark:hmm software available. 8. The treatment of motif-based HMM, in particular the discussion of the approximate common substring problem. 9. The discussion of the "quasi-stationary" property of some chains and the connection with the "Yaglom limit". 10. The treatment of Derin's formula for the smoothing posterior probability of a standard HMM. The author shows in detail that the probability of a finite length emitted sequence conditioned on a state sequence of the HMM depends only on a subsequence of the state sequence. 11. The treatment of the lumping of Markov chains, i.e. the question as to whether a function of a Markov chain is another Markov chain. 12. The very detailed treatment of the Forward-Backward algorithm and the Viterbi algorithm. 13. The discussion of the learning problem via the quasi-log likelihood function for HMM. 14. The discussion of the limit points for the Baum-Welch algorithm. Since the Baum-Welch algorithm deals with iterations of a map, its convergence can be proved by finding the fixed points of this map. These fixed points are in fact the stationary points of the likelihood function and can be related to the convergence of the algorithm via the Zangwill theory of algorithms. Unfortunately the author does not give the details of the Zangwill theory, but instead delegates it to the references (via an exercise). The Zangwill theory can be discussed in the context of nonlinear programming, with generalizations of it occurring in the field of nonlinear functional analysis. It might be interesting to investigate whether the properties of hidden Markov models, especially their rigorous statistical properties, can all be discussed in the context of nonlinear functional analysis.
      Medical Data Analysis: Third International Symposium, ISMDA 2002, Rome, Italy, October 8-11, 2002, Proceedings (Lecture Notes in Computer Science)
      Average customer rating: Not rated
        Medical Data Analysis: Third International Symposium, ISMDA 2002, Rome, Italy, October 8-11, 2002, Proceedings (Lecture Notes in Computer Science)

        Manufacturer: Springer
        ProductGroup: Book
        Binding: Paperback

        GeneralGeneral | Administration & Policy | Medicine | Subjects | Books
        Basic ScienceBasic Science | Medicine | Subjects | Books | Anatomy | Biochemistry | Embryology | General | Genetics | Histology | Immunology | Microbiology | Nosology | Pathophysiology | Physiology
        GeneralGeneral | Medicine | Subjects | Books
        GeneralGeneral | Research | Medicine | Subjects | Books
        EpidemiologyEpidemiology | Infectious Disease | Internal Medicine | Medicine | Subjects | Books
        GeneralGeneral | Administration & Medicine Economics | Medical | Professional & Technical | Subjects | Books
        ResearchResearch | Medical | Professional & Technical | Subjects | Books
        Medical InformaticsMedical Informatics | Medical | Professional & Technical | Subjects | Books
        Computer MathematicsComputer Mathematics | Artificial Intelligence | Computer Science | Computers & Internet | Subjects | Books
        GeneralGeneral | Computers & Internet | Subjects | Books
        GeneralGeneral | Databases | Computers & Internet | Subjects | Books
        BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
        ASIN: 3540000445

        Book Description

        This book constitutes the refereed proceedings of the Third International Symposium on Medical Data Analysis, ISMDA 2002, held in Rome, Italy, in October 2002. The 23 revised full papers presented were carefully reviewed and selected for inclusion in the book. The papers are organized in topical sections on data mining and decision support systems, medical informatics and modeling, time series analysis, and medical imaging.
        Statistical Advances in the Biomedical Sciences: Clinical Trials, Epidemiology, Survival Analysis, and Bioinformatics (Wiley Series in Probability and Statistics)
        Average customer rating: Not rated
          Statistical Advances in the Biomedical Sciences: Clinical Trials, Epidemiology, Survival Analysis, and Bioinformatics (Wiley Series in Probability and Statistics)

          Manufacturer: Wiley-Interscience
          ProductGroup: Book
          Binding: Hardcover

          GeneralGeneral | Science | Subjects | Books
          Probability & StatisticsProbability & Statistics | Applied | Mathematics | Science | Subjects | Books
          BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
          GeneralGeneral | Medicine | Subjects | Books
          GeneralGeneral | Research | Medicine | Subjects | Books
          EpidemiologyEpidemiology | Infectious Disease | Internal Medicine | Medicine | Subjects | Books
          Biomedical EngineeringBiomedical Engineering | Bioengineering | Engineering | Professional & Technical | Subjects | Books
          StatisticsStatistics | Applied | Mathematics | Professional Science | Professional & Technical | Subjects | Books
          ResearchResearch | Medical | Professional & Technical | Subjects | Books
          ASIN: 0471947539

          Book Description

          A number of eminent experts on Clinical Trials, Epidemiology, Survival Analysis, and Genomics/Proteomics have contributed 30 carefully prepared and peer-reviewed articles to this book. Within the four sections, the articles have been organized so as to make the thematic transition between them as smooth as possible. A structural uniformity is maintained across all the chapters, each starting with an introduction that discusses the general concepts and describes the biomedical problem under focus.
          Statistical Bioinformatics: For Biomedical And Life Science Researchers (Methods of Biochemical Analysis)
          Average customer rating: Not rated
            Statistical Bioinformatics: For Biomedical And Life Science Researchers (Methods of Biochemical Analysis)
            Jae K. Lee
            Manufacturer: John Wiley & Sons Inc
            ProductGroup: Book
            Binding: Paperback

            GeneralGeneral | Biology | Biological Sciences | Science | Subjects | Books
            BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
            GeneticsGenetics | Evolution | Science | Subjects | Books
            Probability & StatisticsProbability & Statistics | Applied | Mathematics | Science | Subjects | Books
            ImmunologyImmunology | Basic Science | Medicine | Subjects | Books
            GeneralGeneral | Medicine | Subjects | Books
            ASIN: 0471692727
            Wavelet methods and statistical applications: Network security and bioinformatics : (Dissertation)
            Average customer rating: Not rated
              Wavelet methods and statistical applications: Network security and bioinformatics : (Dissertation)
              Deukwoo Kwon
              Manufacturer: ProQuest Information and Learning
              ProductGroup: Book
              Binding: Digital

              Network SecurityNetwork Security | Networking | Computers & Internet | Subjects | Books
              GeneralGeneral | Science | Subjects | Books
              BioinformaticsBioinformatics | Biological Sciences | Science | Subjects | Books
              Science & TechnologyScience & Technology | Subjects | e-Docs | Formats | Books
              Science & TechnologyScience & Technology | PDF (printable) | Formats | e-Docs | Formats | Books
              ASIN: B000F6I6IC
              Release Date: 2006-03-28

              Book Description

              Citation Details


              Distributed by ProQuest Information and Learning

              Books:

              1. Student Study Guide to accompany Microbiology
              2. Survival of the Sickest: A Medical Maverick Discovers Why We Need Disease
              3. The American Woodland Garden: Capturing the Spirit of the Deciduous Forest
              4. The Botany of Desire: A Plant's-Eye View of the World
              5. The Cherry Blossom Festival: Sakura Celebration
              6. The Coming Plague: Newly Emerging Diseases in a World Out of Balance
              7. The Demon in the Freezer : A True Story
              8. The Developing Person Through the Life Span (paper)
              9. The Elephant's Secret Sense: The Hidden Life of the Wild Herds of Africa
              10. The End of Days: Armageddon and Prophecies of the Return (The Earth Chronicles)

              Books Index

              Books Home

              Recommended Books

              1. Red Moon Rising: How 24-7 Prayer is Awakening a Generation
              2. How to Become an Alpha Male
              3. Killer Curves
              4. Fatal Voyage : A Novel
              5. Lee Frost's Simple Art of Black and White Photography: Easy Methods for Making Fine Art Prints
              6. Laboratory Manual to accompany Biology
              7. I Will Fight No More Forever: Chief Joseph and the Nez Perce War
              8. The American Neptune, Pictorial Supplement VII : A Selection of Marine Paintings By Fitz Hugh Lane,
              9. Geology of Carlsbad Cavern and other caves in the Guadalupe Mountains, New Mexico and Texas
              10. Families of flowering plants in Ethiopia;: An introduction with keys for the identification of the f