Following the scientific literature: A practical guide for (not just computational) biologists

by Igor Ulitsky (Updated 03/2018)

The goal of this guide is to describe why, when, where and how can you follow the most up-to-date science of interest and what papers/journals you should follow. The guide is biased towards the fields of genomics/RNA/systems biology, but all the practical advice and the guidelines for the upper tier journals and top computational biology (compbio) journals apply also if you're into other topics. This guide also assumes that you're passionate about biology and that your fields of scientific interest are not very narrow.

Why should I constantly follow scientific literature?

There are simple pros and cons to devoting time to regularly getting updated on what's going on in other labs worldwide:

Cons

  • It takes time. How much time? Depends on how wide (how many fields/journals will you follow) and how deep (how many papers in each field) you want to be.
  • Most of the papers you will read will be eventually useless, as they will either be very poor, or you will never use the information you read.

The good news about these problems is that the amount of time it will take you to (a) decide if a paper is good; or (b) read a paper properly; drops dramatically over time, particularly if the paper is in the field you're working on (probably about 5- to 10-fold decrease in time spent per "candidate" paper over a course of a PhD).

Pros

  • The more you read – the more knowledgeable you become about (a) the problem you're working in your current project; (b) the frontiers of science that you should work on, and may form your next project; (c) the current status of science in general, approaches used, common difficulties etc. Thus, being up-to-date will save you a lot of time that you may waste:
    • working on problems already solved
    • thinking about problems that are solved/ will be solved very soon by others
    • (If you're a computational biologist) writing your own code for problems already solved by tools available of-the-shelf
  • Having a good grasp of your field is essential for writing fellowship requests, paper introductions, grant proposals etc. (as referring up-to-date studies is very important)
  • The more you read, the better the flow of your research becomes, the easier it is going to be to write your own paper(s). For example, the ability to initiate and carry out a proper computational biology study (choosing a problem, solving it, doing controls, finding new biology), is dramatically improving if you read papers about projects with similar structure (even if the problem they solve is entirely different).
  • Reading a lot of scientific papers improve your critical judgment– you will become much more critical about both the work of others and about your own (which is always a good thing).
  • (If you follow Science and Nature closely and read things not in your field). You will read about plenty of interesting stuff about neuroscience/education/behavior, that do not make it to regular newspapers (monkeys count better than graduate students, people holding hot coffee cups are more likely to be nice to you etc.)

When should I get updated / read?

The short answer is all the time. Doing a routine update, e.g., once you finish a project, or once a month, is something that sounds appealing, but is impossible in practice. You will probably never get to it (there is always something that seems more urgent than spending 2 weeks just reading papers), and if you do find the time, you would encounter too many papers at once, and miss most of the really useful material, nothing will 'sink in' and you will eventually miss out on most of the Pros described above. Therefore, it is best to screen for interesting stuff and read what passes your 'interesting' filter all the time (i.e., at least once a week). If you're really flooded with work, still mark/print the paper, and get back to it when you have time (The "Save for later" feature in Feedly is particularly useful).

Where to get updated?

There are three main places where you can get updates from:

  • PubMed – where you can get updates both based on specific keywords, by journal or by a specific author (e.g., the head of a lab that you know is doing stuff similar to yours).
  • Journal publisher's websites – usually these get updated both when a new journal issue is out and when new 'advance online' papers become available (it sometimes takes months before these find their way into an issue/print)
  • Science news websites, such as GenomeWeb, news@nature.
  • Blogs/Podcasts

I will focus below on how to get updates from the first two sources, but the methods (i.e., RSS) apply to the third as well.

How to get updated?

There are three simple methods for regularly receiving updates about papers being published. The choice of method depends on your personal preference. The best is probably to use a mixture of these methods, as each has different strengths, and are some journal-specific issues (see table below). The three technical ways of getting updates are:

  1. RSS – the best (in my opinion) option to get updates from most journals and from PubMed. Background on RSS is found here. The general idea is that you need to use an RSS reader, which can be either a standalone software (most e-mail software support also RSS reading) or web-based such as Feedly that I'm using (http://www.feedly.com/).  A feed is basically an XML file that contains some info. The RSS reader regularly checks this XML file and if something changes there – updates the info in your reader. Thus, by hourly/daily/weekly browsing through your unread items you can efficiently get all the updates from a large number of websites. You can also mark items that interest you ("Save for later" in Feedly), easily e-mail an item etc. Marking items that interest you is particularly useful if you want to mark an interesting paper to read it later (e.g., because the printer doesn't work, because the university forgot to pay someone or because a formatted PDF of the paper is not yet available). Importantly, as the usage of RSS is very widespread on the web, and you can use RSS to follow other interesting stuff: news (e.g., t-shirts from Threadless), blogs, the stock market, craigslist etc.
  2. E-mail updates –also can be used to get updates about specific keywords (from PubMed) or from specific journals (Table of Contents, or TOC alerts). The e-mail updates have some cons compared to RSS:
    1. Your Inbox gets cluttered
    2. Each e-mail will usually contain 10s of papers, out of which only one is really interest for you. If you want to mark it (e.g., to read later), share it or e-mail it to someone else, it is more difficult to do so.
    3. In most journals, it is possible only to get e-mail updates about the regular issues, and not about 'advance online' papers.
  3. Visiting the publisher website once a week/2 weeks/month – this option is difficult to stick to and is not recommended. If you do, note that  Nature publishing journals usually put out new papers on Mondays, and Cell Press on Thursdays (both around noon CET). Science also comes out on Thursdays around noon.
  4. Joining Twitter and following people active in your general area of research. This option is increasingly useful because of the increasing use of BioRxiv, which in my experience so far is difficult to follow using the other means. If you're into RNA, one particularly useful user to follow is RNApreprints.

Using these three options are there two main methods for getting updates:

  1. Get updates about every new paper coming out in journal X: This is the best option for the top journals or journals specifically in your main topic of interest. Even if the paper is not directly what you're looking for, it is possible that it is relevant or can ignite your imagination. This can be done by either adding an RSS feed from the journal (listed below, or from the web) or signing up for an e-mail alerts on the journal's homepage (all proper journals have this option these days, but it doesn't always work too reliable).
  2. Get updates about every paper about subject Y: This is a good option if you want to be sure you read everything about a specific topic (lncRNAs, microRNA function, protein interaction networks, p53, chip-seq etc.). This is recommended, as some papers relevant to you may appear in good biomedical journals that are too tedious to follow, as they very rarely publish stuff directly relevant to your work( e.g., NEJM, JAMA, Genetics, Oncogene etc.) The way to do this is by performing a search for your keyword in PubMed, then pressing the RSS button. This will result in a daily digest that you will get as soon as some papers with this keyword are added to the PubMed index.
  3. Get updates about papers with a particular author on them. Can also be done through a PubMed search.

 

Put effort into your query! Otherwise you will get a lot of papers in foreign languages. Take into account that no matter how hard you try, if you search by a common keyword (e.g., microRNA function), >80% of the papers will be coming from very (very) small journals and will probably be irrelevant. On the other hand, getting such updates will usually make sure you're not missing any publication relevant to you, and pressing "j" on Feedly you can skim through them very quickly.

 

What journals should I follow?

This is the most complex part to answer. On the one hand you should follow journals that publish things that interest you, but on the other hand, reading some other journals will expand you fields of interest. The answer is probably that you should follow each of Cell/Science/Nature/PLoS Biology, as these are the top biology journals, which publish almost entirely excellent science. You should try and read 1-2 papers from each of these every month or so (most will not be in your direct field, just pick ones that you think could be interesting).  If you're into genomics/systems biology, you must also follow The 'top layer of genetics/genomics' listed below. Once you see you can handle this volume – add to your RSSs/E-mails the other journals of genomics/genetics listed below.

Section

Journal (hyperlinked)

Current issue feed

New articles feed

Volume

Notes

Science / Biology in general:

Cell

RSS

 

Bi-weekly. Few papers but (almost) all describe landmark studies.

RSS feed doesn't really work. Check in every Thursday 

Science

RSS

RSS

Weekly. Only 5-10 papers per week are molecular biology papers, only about 0-1 per month are compbio.

E-mail table of contents are recommended for Nature & Science, since the News and Correspondence sections are also quite interesting sometimes

Nature

RSS

RSS

Weekly. 10-15 papers on molecular biology

PLoS Biology

RSS

 

A bi-weekly digest of papers is sent. 5-6 weekly papers on molecular biology. 

 

Top layer of Genetics / Genomics

Nature Genetics

RSS

RSS

Monthly

 

Nature Biotechnology

RSS

RSS

Monthly, ~2-3 papers are compbio relevant

 

Molecular Cell

RSS

Bi-weekly

 

Genome Research

RSS

RSS

Monthly, most is compbio relevant

 

PNAS

RSS

RSS

Weekly, 4-5 papers per week are compbio relevant

PNAS contains different articles from very different fields of science, some of which will be more and some less relevant to you. Therefore, it is better to subscribe to the e-mail alerts, in which the table of contents is broken into sections (and then just read the parts of the table of contents that interest you).

Molecular Systems Biology

RSS

Bi-weekly, mostly compbio-relevant

 

eLife

 

RSS

Papers come out as they are published

 

The EMBO Journal

RSS

RSS

 

 

Reviews

Nature Reviews Genetics

RSS

Monthly

 

Nature Reviews Molecular Cell Biology

RSS

Monthly

 

Genetic & Genomics - optional

PLoS Genetics

RSS

A weekly digest of papers, few compbio-relevant

Mostly papers on genetics, but some are of more broad interest

Trends in Genetics

RSS

Monthly

Most papers are 'review-like', while other describe small 'peculiarities' found, that are usually difficult to explain mechanistically, and sometimes are very interesting

Journal of Biology

RSS

Very rare

The top layer of the BMC journals. Very few papers, but frequently very interesting ones

Genome Biology

RSS

Not very systematic, you get updates occasionally. Many compbio paper

BMC journals publish papers first in an unformatted way- 50+ pages PDFs, that are quite unreadable and not environmentally friendly. It is usually better to wait 4-5 weeks before printing/reading the paper ("Save for later" is very useful here)

BMC Genomics

RSS

Genes & Development

RSS

RSS

Monthly

 

Nucleic Acids Research

RSS

Monthly

Publishes a very large number of papers (40-50). Usually ab. 5 are compbio-relevant, particularly if you're into sequence motifs or RNA

RNA

RSS

RSS

 

 

Core of CompBio journals

PLoS Computational Biology

RSS

A weekly digest of papers

 

Bioinformatics

RSS

Monthly

The e-mail table of contents alerts are broken into sections, making them easier to read

Journal of Computational Biology

RSS

Monthly

More computationally-oriented than the others

Additional optional CompBio journals

BMC bioinformatics

RSS

Not very systematic, you get updates almost daily

Very large volume

PLoS ONE (Computational Biology section)

RSS

A weekly digest

Similar to BMC bioinformatics, but even fewer gems

Briefings in Bioinformatics

RSS

1-3 per month

Reviews

Misc. journals

BMC Systems Biology

RSS

Not very systematic, you get updates occasionally

More focused on the biophysical side of systems biology

Cell Stem Cell

RSS

 

Top resource on stem cell biology

 

The RSS feeds on all Nature journals: http://www.nature.com/webfeeds/index.html

What should I avoid?

A very common scenario is that one decides to start following journals, subscribes to too many RSS feeds/e-mail alerts, gets flooded with too many updates to follow, and abandons following literature altogether.  On the other hand, if you subscribe to just 1-2 journals and use RSS, you will rarely get any updates, which will probably cause you to forget/stop operating the RSS reader. The best strategy is therefore to subscribe to a PubMed feed(s) with specific keyword relevant to your research (resulting in about 10-20 updates per week) + 5-10 top journals (e.g., Cell/Science/Nature/Nature Genetics/Genome Research/PLoS Biology/PLoS Computational Biology/Bioinformatics). If you see that you can handle the inflow of updates and read some papers for about 2 months, and still have appetite for some more science – gradually increase the number of keywords/journals you follow.

Questions? igor.ulitsky@weizmann.ac.il