TY - JOUR
T1 - Transcript-targeted analysis reveals isoform alterations and double-hop fusions in breast cancer
AU - Namba, Shinichi
AU - Ueno, Toshihide
AU - Kojima, Shinya
AU - Kobayashi, Kenya
AU - Kawase, Katsushige
AU - Tanaka, Yosuke
AU - Inoue, Satoshi
AU - Kishigami, Fumishi
AU - Kawashima, Shusuke
AU - Maeda, Noriko
AU - Ogawa, Tomoko
AU - Hazama, Shoichi
AU - Togashi, Yosuke
AU - Ando, Mizuo
AU - Shiraishi, Yuichi
AU - Mano, Hiroyuki
AU - Kawazu, Masahito
N1 - Funding Information:
We thank Ms Miki Tamura, Ms Kaori Sugaya, Dr. Manabu Soda, and Dr. Yoshihiro Yamashita for technical assistance. We are grateful to all patients and families who contributed to this study. Computation time was provided by the Supercomputer System, Human Genome Center, the Institute of Medical Science, the University of Tokyo. This study was supported by grants from Japan Agency for Medical Research and Development (AMED) (JP17am0001001 to H.M.; JP15cm0106085 to S.H.; JP19cm0106502 to M.K.), a grant from the Japan Society for the Promotion of Science (JSPS) (16K07143 and 21H02772 to M.K.), and a grant from the UBE Industries Foundation (to M.K.).
Publisher Copyright:
© 2021, The Author(s).
PY - 2021/12
Y1 - 2021/12
N2 - Although transcriptome alteration is an essential driver of carcinogenesis, the effects of chromosomal structural alterations on the cancer transcriptome are not yet fully understood. Short-read transcript sequencing has prevented researchers from directly exploring full-length transcripts, forcing them to focus on individual splice sites. Here, we develop a pipeline for Multi-Sample long-read Transcriptome Assembly (MuSTA), which enables construction of a transcriptome from long-read sequence data. Using the constructed transcriptome as a reference, we analyze RNA extracted from 22 clinical breast cancer specimens. We identify a comprehensive set of subtype-specific and differentially used isoforms, which extended our knowledge of isoform regulation to unannotated isoforms including a short form TNS3. We also find that the exon–intron structure of fusion transcripts depends on their genomic context, and we identify double-hop fusion transcripts that are transcribed from complex structural rearrangements. For example, a double-hop fusion results in aberrant expression of an endogenous retroviral gene, ERVFRD-1, which is normally expressed exclusively in placenta and is thought to protect fetus from maternal rejection; expression is elevated in several TCGA samples with ERVFRD-1 fusions. Our analyses provide direct evidence that full-length transcript sequencing of clinical samples can add to our understanding of cancer biology and genomics in general.
AB - Although transcriptome alteration is an essential driver of carcinogenesis, the effects of chromosomal structural alterations on the cancer transcriptome are not yet fully understood. Short-read transcript sequencing has prevented researchers from directly exploring full-length transcripts, forcing them to focus on individual splice sites. Here, we develop a pipeline for Multi-Sample long-read Transcriptome Assembly (MuSTA), which enables construction of a transcriptome from long-read sequence data. Using the constructed transcriptome as a reference, we analyze RNA extracted from 22 clinical breast cancer specimens. We identify a comprehensive set of subtype-specific and differentially used isoforms, which extended our knowledge of isoform regulation to unannotated isoforms including a short form TNS3. We also find that the exon–intron structure of fusion transcripts depends on their genomic context, and we identify double-hop fusion transcripts that are transcribed from complex structural rearrangements. For example, a double-hop fusion results in aberrant expression of an endogenous retroviral gene, ERVFRD-1, which is normally expressed exclusively in placenta and is thought to protect fetus from maternal rejection; expression is elevated in several TCGA samples with ERVFRD-1 fusions. Our analyses provide direct evidence that full-length transcript sequencing of clinical samples can add to our understanding of cancer biology and genomics in general.
UR - http://www.scopus.com/inward/record.url?scp=85119689065&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119689065&partnerID=8YFLogxK
U2 - 10.1038/s42003-021-02833-4
DO - 10.1038/s42003-021-02833-4
M3 - Article
C2 - 34811492
AN - SCOPUS:85119689065
SN - 2399-3642
VL - 4
JO - Communications Biology
JF - Communications Biology
IS - 1
M1 - 1320
ER -