De novo Transcript Sequence Reconstruction from RNA-seq using the Trinity Platform for Reference Generation and Analysis

Jul 11, 2013·
B. haas
,
A. papanicolaou
,
M. yassour
,
M. grabherr
,
P. d. blood
,
J. bowden
,
M. b. couger
,
D. eccles
Dr. Bo Li
Dr. Bo Li
,
M. lieber
,
M. d. macmanes
,
M. ott
,
J. orvis
,
N. pochet
,
F. strozzi
,
N. weeks
,
R. westerman
,
T. william
,
C. n. dewey
,
R. henschel
,
R. d. leduc
,
N. friedman
,
A. regev
· 0 min read
Abstract
De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on ’non-model organisms’ of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.
Type
Publication
Nature Protocols
Authors
Authors
Authors
Authors
Dr. Bo Li
Authors
Principal Scientist II
Dr. Bo Li is a Principal Scientist at Genentech, Inc. His research focuses on large-scale single-cell genomics data analysis. Before joining in Genentech, he was an Assistant Professor of Medicine at Harvard Medical School and the director of Bioinformatics and Computational Biology at Center for Immunology and Inflammatory Diseases, Massachusetts General Hospital. He received his Ph.D. in computer science from UW-Madison and completed two postdoctoral trainings with Dr. Lior Pachter at UC Berkeley and Dr. Aviv Regev at Broad Institute. He is best known for developing RSEM, an impactful RNA-seq transcript quantification software. RSEM is cited 22,602 times (Google Scholar) and adopted by several big consortia such as TCGA, ENCODE, GTEx and TOPMed.
Authors
Authors
Authors
Authors
Authors
Authors
Authors
Authors