1000 Genomes Project Publishes Analysis of Completed Pilot Phase

Small genetic differences between individuals help explain why some people have a higher risk than others for developing illnesses such as diabetes or cancer. Recently in the journal Nature, the 1000 Genomes Project, an international public-private consortium, published the most comprehensive map of these genetic differences, called variations, estimated to contain approximately 95 percent of the genetic variation of any person on Earth.

NIH-Funded Study Unveils Potential Genetic Links to Lung Disease Risk

A new study involving data from more than 20,000 individuals has uncovered several DNA sequences linked to impaired pulmonary function. The research, an analysis that combined the results of several smaller studies, provides insight into the mechanisms involved in reaching full lung capacity. The findings may ultimately lead to better understanding of lung function and diseases like asthma and chronic obstructive pulmonary disease (COPD), the fourth leading cause of death in the United States.

Chromosome Telomeres and the Nobel Prize for Medicine

nobel medal in medicineThe 2009 Nobel Prize in Physiology or Medicine was announced earlier this week. The prize was awarded to three U.S. scientists for the discovery of how chromosomes are protected by telomeres and the enzyme telomerase.

Two women, Elizabeth H. Blackburn, age 61, at the University of California in San Francisco, and Carol W. Greider, age 48, at Johns Hopkins University School of Medicine in Baltimore along with one man, Jack W. Szostak, age 57, at Harvard Medical School, will share the $1.4 million prize.

Core Biomedical Research Software and Web 2.0 Tools

Following a discussion on Twitter, Dr. Chris blogged about essential medical software and invited others to share their “core apps”. I thought I would add to the discussion with my own essential programs, albeit with a genomics/pre-clinical research bias.

First: a little about me. I’m a bioinformatician working in neuro-oncology.

Bioinformatician: an individual that uses information technology as applied to the life sciences, especially the technologies used for the collection, storage and retrieval of genomic data.

Last month, an article debating Scientist vs Developer in Bioinformatics was posted on Web 2.0 and Semantic Web for Bioinformaticians. I would fall into the first category (Scientist Bioinformatician), as I’m interested in testing hypotheses and discovering new knowledge.

Michael Barton wrote an entertaining article on career paths for Bioinformaticians earlier this year. In the World of Bioinformatics Quest, my character class is a Wet lab Bioinformatician (WB):

A dry lab scientist, working in wet lab, the WB is the support for all things computational biology in the laboratory. The WB is familiar enough with Perl to create simple scripts, but the real strength of the WB doesn’t lay in writing code, but rather in using tools. While most bioinformaticians are familiar, in theory, with how to find a gene of interest, the WB uses these tools day in and day out — the WB puts many to shame on basic bioinformatics. While others have their head in the clouds thinking about theories and algorithms, the WB is getting his hands dirty with real data as it is being produced.

The majority of my time is spent performing Functional and Comparative Genomics on DNA microarray data. The information I analyze is used to further our understanding of the molecular pathways altered in neurofibromatosis and to identify potential therapeutic targets.

Note that I’m an Apple user (MacBook 2.16 GHz Intel Core 2 Duo, 3 GB 667 Mhz DDR2 SDRAM), so all my core apps are for a Mac. With all of the visual data I manipulate, a good monitor is essential; I use a Samsung Syncmaster 245BW 24″ LCD monitor. Finishing out the hardware list is an Apple wireless keyboard and mouse.

Local Mac applications

GeneSpring GX

GeneSpring GX provides powerful, accessible statistical tools for fast visualization and analysis of gene expression data. Sure, there are more powerful alternatives (I hear all you R users grumbling …). Nevertheless, the ability to quickly manipulate gene lists and display gene expression patterns makes GeneSpring GX an essential core app for expression analysis.

R/Bioconductor

I use R in conjunction with Bioconductor, an open source and open development software project for the analysis and comprehension of genomic data, principally for data normalization prior to import and analysis in GeneSpring. I routinely use custom CDF ([Gene] Chip Definition File) libraries, so R/Bioconductor is absolutely essential for data preprocessing.

Microsoft Excel

Genomics analysis requires manipulation of many, many data points (100 samples x 56,000 probe sets = 5.6 million data points). Sorting, searching and organizing the data can all be accomplished with Microsoft Excel. A necessary evil.

Skim

Skim is a PDF reader and note-taker for OS X. It is designed to help you read and annotate scientific papers in PDF, but is also great for viewing any PDF file (think Preview on steroids). Unlike Adobe Acrobat, it’s FAST.

Bookends

I’ve tried a number of applications to organize and tag scientific journal articles. Keeping up is an impossible task. I’ve tried several applications (yes, I’ve even tried Papers). However, for those of us that write research articles for publication, a reference manager is an essential app. Bookends is bibliography software that can also organize PDFs. The interface isn’t flashy but it’s a powerful piece of software. You can perform PubMed searches, retrieve references and associated PFDs and create static, smart and virtual groups to organize and find information in your personal collection. And it’s FAST. Bookends uses Spotlight to search within PDFs. You can search titles, abstracts, authors, journals, dates and more. Bookends is actively supported and has a useful user forum.

Microsoft Word

Speaking of writing research articles for publication, Microsoft Word is another necessary evil. I’m writing all the time and a powerful word processor is a must.

Microsoft PowerPoint

Yes, another Microsoft app. Scientists are always talking about their research and pictures speak louder than words. Especially in my position as a Bioinformatician, I need software that allows me to display and present information; Microsoft PowerPoint is that application.

I should pause here and mention out that I’m testing Apple iWork to replace Word and PowerPoint. The entire Office suite isn’t optimized for OS X (it really runs poorly) and both Pages and Keynote — iWork apps — rival the utility of Word and PowerPoint.

Web applications/databases

I spend a great deal of time analyzing large high-throughput gene expression data sets. Each analysis involves addressing several hypotheses and establishing biological meaning from the data. I rely on a number of web applications and databases to identify genes or gene sets associated with a particular phenotype (meaning an observable characteristic) and to identify enriched biological themes, particularly gene ontology (GO) terms.

PubMed

Efficient literature search is essential to the practice of Evidence-Based Medicine and to Biomedical Research. PubMed provides free access to MEDLINE, one of the largest searchable databases of biomedical research articles. An absolute necessity.

DAVID Bioinformatics Resources 2008

The Database for Annotation, Visualization and Integrated Discovery (DAVID) 2008 provides a comprehensive set of functional annotation tools for researchers to understand biological meaning behind large list of genes. I’ve tried various other tools for functional annotation analysis, but I find DAVID the most comprehensive app on the web. DAVID is owned by the NIH and is under continuous development and improvement.

OMIM

OMIM (Online Mendelian Inheritance in Man) is a comprehensive and authoritative compendium of human genes and genetic phenotypes. The full-text, referenced overviews in OMIM contain information on all known mendelian disorders and over 12,000 genes. OMIM focuses on the relationship between phenotype and genotype. I frequently link to OMIM for genes discussed on Highlight HEALTH.

iHop

No, not the restaurant. Information Hyperlinked over Proteins (iHop) provides the network of genes and proteins as a natural way of accessing the millions of abstracts in PubMed. By employing genes and proteins as hyperlinks between sentences and abstracts, the information in PubMed becomes bound together into one navigable resource.

Web 2.0 tools

I’m using Web 2.0 tools more and more in my daily workflow. Many have become indispensable for organization and communication.

WordPress

Highlight HEALTH runs on WordPress. I’ve used WordPress for a couple of years now and (between plugins and hacks) it does everything I need it to do.

Delicious

I gave up trying to manage browser-based bookmarks some time ago. Delicious.com allows users to tag, save, manage and share web pages from a centralized source. The latest addition to my workflow is the use of the “toread” tag to find bookmarked articles of interest for later review (since emailing or saving links to the desktop is inefficient).

FreshReader

I use RSS frequently, not only for blogs of interest but also to keep up with biomedical journals (current issue table of contents RSS), PubMed searches (to keep up with the latest published research in my field of work), LinkedIn updates and news. I try not to depend on Google too much, although that can be difficult nowadays. For RSS, I use FreshReader, a web-based, server type RSS and Atom feed reader. Fed up with slow feed readers? FreshReader is FAST.

FriendFeed

FriendFeed is becoming an excellent social networking tool, since several rooms have been setup for life science researchers and programmers. In addition to being a social aggregator, FriendFeed has comment streams that allow you to contribute to the conversation. In many cases, entire discussions are held entirely on FriendFeed.

Twitter

Twitter is also becoming a useful tool to communicate with other doctors and researchers (most are also using FriendFeed as mentioned above). Twitter allows you to keep up with people that share similar interests.

If you use both both FriendFeed and Twitter, I’ve found a convenient tool. MySocialChatter.com brings FriendFeed and Twitter together in a single web page using frames and automatically refreshes every few minutes. It’s quite useful for keeping an eye on the conversation, is easy to read (compared to some desktop apps) and is non-disruptive.

… And the rest that I couldn’t live without

Other everyday applications that I couldn’t live without:

Quicksilver

Act without doing. If you use a Mac, Quicksilver is simply a must have.

Expose

Move over command-tab (that’s alt-tab for all you windows users), Expose is AWESOME.

Mail.app

Mail.app, my e-mail program of choice.

Safari

The web, Apple style. Safari with the Saft extension is my internet browser of choice. OS X optimized and FAST.

Journler

My digital notepad. Journler is a daily notebook and entry based information manager. Never again do I have to deal with miscellaneous text files scattered across my hard drive or random bits of text emailed to myself. That said however, I am looking into Evernote to supplement (and possibly replace) Journler.

iTunes

My iTunes plays mostly ambient, new age music. I frequently work with Groove Salad or Radio IO Ambient playing in the background.

Final thoughts

So, there you go. There’s many applications, but since I’m a bioinformatician that should come as no surprise. I think you’ll get a better idea of who I am and what I do from the software tools I use and rely on everyday. These are my core biomedical research apps.

Are you a physician or biomedical scientist? Is there software or Web 2.0 tools that you couldn’t live without? What are your core apps? Share in the comments below or write your own post and link back here.

Gene Genie #32 – Googling the Genie

Welcome to the 32nd edition of Gene Genie, a blog carnival devoted to genes and genetic conditions. This edition includes some excellent articles on genes and gene-related diseases, genetics, genomics and personalized genetics.

Google Health launched publicly this week and to recognize the event, the last section of the carnival is devoted to articles specifically about the service. Google, financial backer of 23andMe, also funds the Personal Genome Project, which plans to unlock the secrets of common diseases by decoding the DNA of 100,000 people in the world’s biggest gene sequencing project [1]. With the vast number of genetic data points collected for each genome sequenced, a digital system for the movement and storage of personal health information is critical for the widespread use of individualized healthcare. Google’s entrance into the online personal health records market may thus help to accelerate the era of personalized medicine.

With these thoughts in mind, let’s get to this month’s edition of the Genie.