Maker: An Easy-to-use Annotation Pipeline Designed For Emerging Model Organism Genomes

ID U-4622

Category Research Tools (Non-Tangible Property)

Subcategory Software

Researchers
Brief Summary

MAKER is a portable and easily configurable genome annotation pipeline. Its purpose is to allow investigators to independently annotate eukaryotic genomes and create genome databases.

Problem Statement

One of the great ironies of the DNA sequencing revolution is that genome annotation, not genome sequencing, has become the bottleneck in genomics today. New genomes are being sequenced at a far faster rate than they are being annotated. Tools to manage and analyze gene annotations are in great demand as more genomes are sequenced and published. Annotation management and comparison at the whole-genome scale is often restricted to analyses of basic traits. Although simple statistics are useful, they don’t move beyond the amino acid sequences to take into account other aspects of the annotations such as similarities in intron-exon structures and patterns of alternative splicing.

Technology Description

The Yandell lab has been working on an easy-to-use genome annotation pipeline called MAKER (yandell.topaz.genetics.utah.edu/cgi-bin/maker_license.cgi). The goal is to provide teams without extensive bioinformatics expertise the means to independently annotate genomes, with the option to distribute the results to the larger biomedical community. MAKER introduces new measures to evaluate changes to annotations across releases, such as Annotation Turnover, and Annotation Edit Distance. The other new measure that moves beyond state of the art quantifies the complexity of alternative splicing, called Splice Complexity. It offers two classes of analyses intra-genome comparisons of annotations that track and summarize genome-wide changes in annotations from release to release, and inter-genome comparisons that compare and contrast the annotations of different genomes to another. Practical issues such as completeness and usability are also taken into account.

Stage of Development

Generating Revenue

Benefit

  • MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ab initio gene predictions, and automatically synthesizes these data into gene annotations having evidence-based quality indices.
  • MAKER is easily trainable.
  • The outputs of preliminary runs are used to automatically retrain its gene-prediction algorithm, producing higher-quality gene-models on subsequent runs.
  • The inputs for MAKER are minimal, and its outputs can be used to create a GMOD database.
  • The outputs can be viewed in the Apollo Genome browser, which gives it an easy means to annotate, view, and edit individual contigs and BACs without the overhead of a database.

Publications

Eilbeck, K et al. “Quantitative measures for the management and comparison of annotated genomes.” BMC bioinformatics vol. 10 67. 23 Feb. 2009, doi:10.1186/1471-2105-10-67

Cantarel, BL et al. “MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes.” Genome res vol. 18,1 (2008): 188-96. doi:10.1101/gr.6743907

Campbell, MS et al. “Genome Annotation and Curation Using MAKER and MAKER-P.” Curr protocols in bioinformatics vol. 48 4.11.1-39. 12 Dec. 2014, doi:10.1002/0471250953.bi0411s48

Contact Info

Aaron Duffy
(801) 585-1377
aaron.duffy@utah.edu

Questions?

We support you and your innovation.

Wherever you are on your innovation journey, the Technology Licensing Office is your go-to source to connect you with the U’s innovation ecosystem.

Call 801.581.7792 or send us a message