Welcome to PolyAsite
Repository for 3' end sequencing data
We are planning an update of our Atlas for 2019!
Let us know if you have new 3' end sequencing data
you think we should incorporate, or if there are any
new features you'd like to see here.
-
2P-Seq
-
In the 2P-Seq protocol, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have 20 As preceding the 3' adapter. Libraries are sequenced in anti-sense direction with a custom primer requiring to reverse complement reads to pinpoint the 3' end.
Publications & data sets
- Almada, A.E., et al. Promoter directionality is controlled by U1 snRNP and polyadenylation signals. Nature 499.360-3 (2013).
- Spies, N., Burge, C.B., Bartel, D.P. 3' UTR-isoform choice has limited influence on the stability and translational efficiency of most mRNAs in mouse fibroblasts. Genome Res 23, 2078-90 (2013).
-
In the 2P-Seq protocol, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have 20 As preceding the 3' adapter. Libraries are sequenced in anti-sense direction with a custom primer requiring to reverse complement reads to pinpoint the 3' end.
-
3'-Seq (Mayr)
-
In the 3'-Seq protocol by Mayr and colleagues, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have 17 As preceding the 3' adapter. Libraries are sequenced in sense direction requiring removal of the 3' adapter sequence and preceding As to pinpoint the 3' end.
Publications & data sets
-
In the 3'-Seq protocol by Mayr and colleagues, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have 17 As preceding the 3' adapter. Libraries are sequenced in sense direction requiring removal of the 3' adapter sequence and preceding As to pinpoint the 3' end.
-
3'READS
-
3' region extraction and deep sequencing (3'READS) is a protocol that utilizes a special oligo(dT) primer ( 45 thymidines followed by 5 uridines) to caputre poly(A) containing RNA fragments. After partial digestion of the poly(A) tail, the fragments are subjected to adapter ligation, reverse transcription, and PCR amplification before they are sequenced in anti-sense direction. The cleavage site is inferred as the first non-A of the 3' end of the read's reverse complement.
Publications & data sets
- Li, W. et al. Systematic Profiling of poly(A)+ Transcripts Modulated by Core 3’ End Processing and Splicing Factors Reveals Regulatory Rules of Alternative Cleavage and Polyadenylation. PLoS Genetics 11 (4), e1005166 (2015).
- GSM1586364
- GSM1518114
- GSM1586368
- GSM1518100
- GSM1586363
- GSM1518071
- GSM1518072
- GSM1518073
- GSM1518074
- GSM1518075
- GSM1518076
- GSM1518077
- GSM1518078
- GSM1518079
- GSM1518080
- GSM1518081
- GSM1518082
- GSM1518083
- GSM1518084
- GSM1518086
- GSM1518085
- GSM1518087
- GSM1518089
- GSM1518088
- GSM1518091
- GSM1518090
- GSM1518092
- GSM1518093
- GSM1518094
- GSM1518095
- GSM1518096
- GSM1518097
- GSM1518098
- GSM1518101
- GSM1518099
- GSM1518102
- GSM1518103
- GSM1518104
- GSM1518105
- GSM1518106
- GSM1518108
- GSM1518107
- GSM1518109
- GSM1518110
- GSM1518111
- GSM1518112
- GSM1518113
- GSM1586365
- GSM1586366
- GSM1586367
-
3' region extraction and deep sequencing (3'READS) is a protocol that utilizes a special oligo(dT) primer ( 45 thymidines followed by 5 uridines) to caputre poly(A) containing RNA fragments. After partial digestion of the poly(A) tail, the fragments are subjected to adapter ligation, reverse transcription, and PCR amplification before they are sequenced in anti-sense direction. The cleavage site is inferred as the first non-A of the 3' end of the read's reverse complement.
-
3P-Seq
-
In the 3P-Seq protocol, a biotinylated adapter is ligated to the end of the poly(A) tail via splint-ligation and the poly(A) region is reverse transcribed with only dTTP. After removal of most of the poly(A) tail, adapters are ligated prior to reverse transcription and PCR amplification. Finally, the libraries are sequenced in anti-sense direction and the resulting reads are reverse complemented in order to pinpoint the 3' ends.
Publications & data sets
-
In the 3P-Seq protocol, a biotinylated adapter is ligated to the end of the poly(A) tail via splint-ligation and the poly(A) region is reverse transcribed with only dTTP. After removal of most of the poly(A) tail, adapters are ligated prior to reverse transcription and PCR amplification. Finally, the libraries are sequenced in anti-sense direction and the resulting reads are reverse complemented in order to pinpoint the 3' ends.
-
A-seq
-
In the A-seq protocol, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have six As preceding the 3' adapter. Libraries are sequenced in sense direction requiring removal of the 3' adapter sequence and preceding As to pinpoint the 3' end.
Publications & data sets
- Martin, G., Gruber, A. R., Keller, W. & Zavolan, M. Genome-wide analysis of pre-mRNA 3’ end processing reveals a decisive role of human cleavage factor I in the regulation of 3' UTR length. Cell Rep 1, 753–763 (2012).
- Gruber, AR. et al. Global 3’ UTR Shortening Has a Limited Effect on Protein Abundance in Proliferating T Cells. Nat Commun 5, 5465 (2014).
- Gruber, A. R., Martin, G., Keller, W. & Zavolan, M. Cleavage factor Im is a key regulator of 3’ UTR length. RNA Biol 9, 1405–1412 (2012).
-
In the A-seq protocol, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have six As preceding the 3' adapter. Libraries are sequenced in sense direction requiring removal of the 3' adapter sequence and preceding As to pinpoint the 3' end.
-
DRS
-
In the direct RNA sequencing (DRS) protocol, 3' ends of transcripts are hybridized to poly(dT)-coated flow cell surfaces where antisense strand synthesis is initiated. This has the advantage that no prior reverse transcription or cDNA amplification is needed.
Publications & data sets
- Rehfeld, A. et al. Alternative polyadenylation of tumor suppressor genes in small intestinal neuroendocrine tumors. Front Endocrinol (Lausanne) 5, 46 (2014).
- Yao, C. et al. Transcriptome-wide analyses of CstF64-RNA interactions in global regulation of mRNA alternative polyadenylation. Proc Natl Acad Sci U S A 109, 18773–18778 (2012).
- Ji, X. et al. αCP Poly(C) Binding Proteins Act as Global Regulators of Alternative Polyadenylation. Mol Cell Biol 33, 2560–2573 (2013).
- Lackford, B. et al. Fip1 regulates mRNA alternative polyadenylation to promote stem cell self-renewal. EMBO J (2014).
-
In the direct RNA sequencing (DRS) protocol, 3' ends of transcripts are hybridized to poly(dT)-coated flow cell surfaces where antisense strand synthesis is initiated. This has the advantage that no prior reverse transcription or cDNA amplification is needed.
-
PAS-Seq
-
In the PAS-Seq protocol, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have 20 As preceding the 3' adapter. Libraries are sequenced in anti-sense direction with a costum primer requiring to reverse complement reads to pinpoint the 3' end.
Publications & data sets
-
In the PAS-Seq protocol, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have 20 As preceding the 3' adapter. Libraries are sequenced in anti-sense direction with a costum primer requiring to reverse complement reads to pinpoint the 3' end.
-
PolyA-seq
-
Library preparation for the PolyA-seq protocol requires (1) Reverse transcription, primed with an oligo(dT) sequence succeeding an universal sequence used as PCR anchor, (2) second strand synthesis with random hexamers linked to a second PCR primer, and (3) PCR amplification. Sequencing is accomplished in anti-sense orientation with a primer ending in 10 Ts; the resulting reads need to be reverse complemented in order to pinpoint the mRNA cleavage site.
Publications & data sets
-
Library preparation for the PolyA-seq protocol requires (1) Reverse transcription, primed with an oligo(dT) sequence succeeding an universal sequence used as PCR anchor, (2) second strand synthesis with random hexamers linked to a second PCR primer, and (3) PCR amplification. Sequencing is accomplished in anti-sense orientation with a primer ending in 10 Ts; the resulting reads need to be reverse complemented in order to pinpoint the mRNA cleavage site.
-
SAPAS
-
In the SAPAS protocol, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have the sequence AAAAAAGAAAAAAGAAAAAA preceding the 3' adapter. Libraries are sequenced in anti-sense direction with a regular primer requiring to trimm 20 nucleotides from the 5' end of reads and to reverse complement reads to pinpoint the 3' end.
Publications & data sets
- You, L. et al. APASdb: A Database Describing Alternative poly(A) Sites and Selection of Heterogeneous Cleavage Sites Downstream of poly(A) Signals. Nucl Acids Res 43 (Database issue), D59–67 (2015).
- Fu, Y. et al. Differential genome-wide profiling of tandem 3’ UTRs among human breast cancer and normal cells by high-throughput sequencing. Genome Res 21, 741–747 (2011).
- SRX480169
- SRX480179
- SRX480205
- SRX480212
- SRX480221
- SRX480227
- SRX480229
- SRX480250
- SRX480287
- SRX026582
- SRX026583
- SRX026584
- SRX517313
- SRX517314
- SRX517315
- SRX517316
- SRX517317
- SRX517318
- SRX517319
- SRX517320
- SRX517321
- SRX517322
- SRX517323
- SRX517324
- SRX517325
- SRX517326
- SRX517327
- SRX517328
- SRX517329
- SRX517330
- SRX517331
- SRX517332
- SRX517333
- SRX517334
-
In the SAPAS protocol, reverse transcription is accomplished by an anchored oligo(dT) primer. The products of reverse transcription and PCR amplification are expected to have the sequence AAAAAAGAAAAAAGAAAAAA preceding the 3' adapter. Libraries are sequenced in anti-sense direction with a regular primer requiring to trimm 20 nucleotides from the 5' end of reads and to reverse complement reads to pinpoint the 3' end.
-
Currently, PolyASite hosts following poly(A) site annotations:
Homo sapiens - version: 1.0 beta (hg19)
(Date of release: 2015-05-01 00:00:00)
This poly(A) site annotation was built using a total of 305,026,442 reads from 56 3'-end sequencing libraries (click here to see) generated with following 3' end sequencing protocols: 3'-Seq (Mayr), DRS, SAPAS, 3P-Seq, PolyA-seq, PAS-Seq, A-seq.
Download our inferred 3' end clusters as a BED file
We follow the standard BED specification with 0-based coordinates. Additionally, we appended extra column(s). For more information click here.
BED file
Homo sapiens - version: r1.0 (hg19)
(Date of release: 2015-10-26 08:00:00)
This poly(A) site annotation was built using a total of 475,703,248 reads from 78 3'-end sequencing libraries (click here to see) generated with following 3' end sequencing protocols: 3'-Seq (Mayr), DRS, SAPAS, 3P-Seq, PolyA-seq, PAS-Seq, A-seq.
Download our inferred 3' end clusters as a BED file
We follow the standard BED specification with 0-based coordinates. Additionally, we appended extra column(s). For more information click here.
BED file
Additional version(s) of the atlas
An additional column outlines the tissues (or cell types) in which each poly(A) site was detected.
BED with tissue info
The poly(A) sites were annotated based on the protein-coding genes and lincRNAs contained in the UCSC Basic Table of GENCODE V19. The BED file below provides the same poly(A) sites but their genomic context is annotated based on the full UCSC comprehensive Table for GENCODE V19.
BED file (different poly(A) site annotation)
Mus musculus - version: 1.0 beta (mm10)
(Date of release: 2015-05-01 00:00:00)
This poly(A) site annotation was built using a total of 528,924,694 reads from 51 3'-end sequencing libraries (click here to see) generated with following 3' end sequencing protocols: DRS, 3P-Seq, 2P-Seq, PolyA-seq, PAS-Seq, A-seq.
Download our inferred 3' end clusters as a BED file
We follow the standard BED specification with 0-based coordinates. Additionally, we appended extra column(s). For more information click here.
BED file
Mus musculus - version: r1.0 (mm10)
(Date of release: 2015-10-26 08:00:00)
This poly(A) site annotation was built using a total of 723,637,848 reads from 110 3'-end sequencing libraries (click here to see) generated with following 3' end sequencing protocols: 3'READS, DRS, SAPAS, 3P-Seq, 2P-Seq, PolyA-seq, PAS-Seq, A-seq.
Download our inferred 3' end clusters as a BED file
We follow the standard BED specification with 0-based coordinates. Additionally, we appended extra column(s). For more information click here.
BED file
Additional version(s) of the atlas
An additional column outlines the tissues (or cell types) in which each poly(A) site was detected.
BED with tissue info
Organisms
Protocols
-
SAPAS | Human
SRX026582
Source MDA-MB-231
Treatment none
-
SAPAS | Human
SRX026583
Source MCF-10A
Treatment none
-
SAPAS | Human
SRX026584
Source MCF-7
Treatment none
-
SAPAS | Human
SRX517313
Source brain
Treatment none
-
SAPAS | Human
SRX517314
Source thyroid
Treatment none
-
SAPAS | Human
SRX517315
Source spleen
Treatment none
-
SAPAS | Human
SRX517316
Source stomach
Treatment none
-
SAPAS | Human
SRX517317
Source kidney
Treatment none
-
SAPAS | Human
SRX517318
Source cervix
Treatment none
-
SAPAS | Human
SRX517319
Source heart
Treatment none
-
SAPAS | Human
SRX517320
Source lymph node
Treatment none
-
SAPAS | Human
SRX517321
Source placenta
Treatment none
-
SAPAS | Human
SRX517322
Source lung
Treatment none
-
SAPAS | Human
SRX517323
Source uterus
Treatment none
-
SAPAS | Human
SRX517324
Source bladder
Treatment none
-
SAPAS | Human
SRX517325
Source breast
Treatment none
-
SAPAS | Human
SRX517326
Source prostate
Treatment none
-
SAPAS | Human
SRX517327
Source liver
Treatment none
-
SAPAS | Human
SRX517328
Source pancreas
Treatment none
-
SAPAS | Human
SRX517329
Source small intestine
Treatment none
-
SAPAS | Human
SRX517330
Source thymus
Treatment none
-
SAPAS | Human
SRX517331
Source adipose
Treatment none
-
SAPAS | Human
SRX517332
Source skeletal muscle
Treatment none
-
SAPAS | Human
SRX517333
Source ovary
Treatment none
-
SAPAS | Human
SRX517334
Source testis
Treatment none
-
PAS-Seq | Mouse
GSM624687
Source ES
Treatment NA
-
3'-Seq (Mayr) | Human
SRX351949
Source native B cells
Treatment NA
-
3'-Seq (Mayr) | Human
SRX351950
Source native B cells
Treatment NA
-
3'-Seq (Mayr) | Human
SRX351952
Source brain
Treatment NA
-
3'-Seq (Mayr) | Human
SRX351953
Source breast
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359328
Source embryonic stem cells (H9)
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359329
Source ovary
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359330
Source skeletal muscle
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359331
Source testis
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359332
Source MCF10A
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359333
Source MCF10A
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359334
Source MCF7
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359335
Source HeLa
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359336
Source HEK293
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359337
Source NTERA2
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359339
Source B-LCL cells
Treatment NA
-
3'-Seq (Mayr) | Human
SRX359340
Source MCF10A
Treatment expression of HRAS (G12V)
-
3'-Seq (Mayr) | Human
SRX359341
Source MCF10A
Treatment expression of HRAS (G12V)
-
3P-Seq | Mouse
GSM1268946
Source heart
Treatment none
-
3P-Seq | Mouse
GSM1268947
Source muscle
Treatment none
-
3P-Seq | Mouse
GSM1268948
Source liver
Treatment none
-
3P-Seq | Mouse
GSM1268949
Source lung
Treatment none
-
3P-Seq | Mouse
GSM1268950
Source wat
Treatment none
-
3P-Seq | Mouse
GSM1268951
Source kidney
Treatment none
-
3P-Seq | Mouse
GSM1268952
Source heart
Treatment miR-22 knockout
-
3P-Seq | Mouse
GSM1268953
Source muscle
Treatment miR-22 knockout
-
3P-Seq | Mouse
GSM1268954
Source liver
Treatment miR-22 knockout
-
3P-Seq | Mouse
GSM1268955
Source lung
Treatment miR-22 knockout
-
3P-Seq | Mouse
GSM1268956
Source wat
Treatment miR-22 knockout
-
3P-Seq | Mouse
GSM1268957
Source kidney
Treatment miR-22 knockout
-
3P-Seq | Mouse
GSM1268958
Source embryonic stem cells
Treatment none
-
PAS-Seq | Human
GSM624686
Source HeLa
Treatment none
-
A-seq | Mouse
GSM1327166
Source T cells
Treatment none
-
A-seq | Mouse
GSM1327167
Source T cells
Treatment activation
-
A-seq | Mouse
GSM1327168
Source T cells
Treatment none
-
A-seq | Mouse
GSM1327169
Source T cells
Treatment none
-
2P-Seq | Mouse
GSM1089085
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089086
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089087
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089088
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089089
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089090
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089091
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089092
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089093
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089094
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089095
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1089096
Source 3T3
Treatment N/A
-
2P-Seq | Mouse
GSM1130096
Source embryonic stem cells
Treatment none
-
2P-Seq | Mouse
GSM1130097
Source embryonic stem cells
Treatment none
-
2P-Seq | Mouse
GSM1130098
Source embryonic stem cells
Treatment siRNA control
-
2P-Seq | Mouse
GSM1130099
Source embryonic stem cells
Treatment siRNA control
-
2P-Seq | Mouse
GSM1130100
Source embryonic stem cells
Treatment siRNA U1
-
2P-Seq | Mouse
GSM1130101
Source embryonic stem cells
Treatment siRNA U1
-
PolyA-seq | Mouse
GSM747481
Source Brain
Treatment none
-
PolyA-seq | Mouse
GSM747482
Source Kidney
Treatment none
-
PolyA-seq | Mouse
GSM747483
Source Liver
Treatment none
-
PolyA-seq | Mouse
GSM747484
Source Muscle
Treatment none
-
PolyA-seq | Mouse
GSM747485
Source Testis
Treatment none
-
PolyA-seq | Mouse
GSM1480973
Source MEF
Treatment none
-
PolyA-seq | Mouse
GSM1480974
Source MEF
Treatment none
-
PolyA-seq | Mouse
GSM1480975
Source MEF
Treatment double KO Mbnl
-
PolyA-seq | Mouse
GSM1480976
Source MEF
Treatment double KO Mbnl
-
PolyA-seq | Mouse
GSM1480977
Source MEF
Treatment triple KO/KD Mbnl
-
PolyA-seq | Mouse
GSM1480978
Source MEF
Treatment triple KO/KD Mbnl
-
PolyA-seq | Mouse
GSM1480979
Source MEF
Treatment siRNA Control
-
PolyA-seq | Mouse
GSM1480980
Source MEF
Treatment siRNA Control
-
DRS | Mouse
SRX304983
Source embryonic stem cell line E14Tg2a
Treatment siRNA Fip1
-
DRS | Mouse
SRX304982
Source embryonic stem cell line E14Tg2a
Treatment control siRNA
-
3'READS | Mouse
GSM1586364
Source NA
Treatment cntrl
-
3'READS | Mouse
GSM1518114
Source NA
Treatment siCtrl8N
-
3'READS | Mouse
GSM1586368
Source NA
Treatment siCtrl8T
-
3'READS | Mouse
GSM1518100
Source NA
Treatment siCFI-68.8N
-
3'READS | Mouse
GSM1586363
Source NA
Treatment siCFI-68.8T
-
3'READS | Mouse
GSM1518071
Source NA
Treatment siCPSF-160.1
-
3'READS | Mouse
GSM1518072
Source NA
Treatment siCPSF-100.3
-
3'READS | Mouse
GSM1518073
Source NA
Treatment siCPSF-73.1
-
3'READS | Mouse
GSM1518074
Source NA
Treatment siCPSF-30.3
-
3'READS | Mouse
GSM1518075
Source NA
Treatment siFip1.2
-
3'READS | Mouse
GSM1518076
Source NA
Treatment siWDR33.4
-
3'READS | Mouse
GSM1518077
Source NA
Treatment siCstF-50.4
-
3'READS | Mouse
GSM1518078
Source NA
Treatment siCstF-64.1
-
3'READS | Mouse
GSM1518079
Source NA
Treatment siτCstF-64.4
-
3'READS | Mouse
GSM1518080
Source NA
Treatment siCstF-77.5
-
3'READS | Mouse
GSM1518081
Source NA
Treatment siCFI-25.1
-
3'READS | Mouse
GSM1518082
Source NA
Treatment siCFI-68.1
-
3'READS | Mouse
GSM1518083
Source NA
Treatment siCFI-59.1
-
3'READS | Mouse
GSM1518084
Source NA
Treatment siPcf11.4
-
3'READS | Mouse
GSM1518086
Source NA
Treatment siSymplekin.4
-
3'READS | Mouse
GSM1518085
Source NA
Treatment siClp1.4
-
3'READS | Mouse
GSM1518087
Source NA
Treatment siPAPα.6
-
3'READS | Mouse
GSM1518089
Source NA
Treatment siPABPN1.3
-
3'READS | Mouse
GSM1518088
Source NA
Treatment siPAPγ.6
-
3'READS | Mouse
GSM1518091
Source NA
Treatment siPP1α.3
-
3'READS | Mouse
GSM1518090
Source NA
Treatment siPABPC1.3
-
3'READS | Mouse
GSM1518092
Source NA
Treatment siPP1β.3
-
3'READS | Mouse
GSM1518093
Source NA
Treatment siRBBP6.4
-
3'READS | Mouse
GSM1518094
Source NA
Treatment siU2AF65.7
-
3'READS | Mouse
GSM1518095
Source NA
Treatment siSF3b155.7
-
3'READS | Mouse
GSM1518096
Source NA
Treatment siU1-70K.7
-
3'READS | Mouse
GSM1518097
Source NA
Treatment U1D_8h
-
3'READS | Mouse
GSM1518098
Source NA
Treatment U1D_24h
-
3'READS | Mouse
GSM1518101
Source NA
Treatment siFip1.8N
-
3'READS | Mouse
GSM1518099
Source NA
Treatment siRRP44_siRRP6.6
-
3'READS | Mouse
GSM1518102
Source NA
Treatment siPABPC1.8N
-
3'READS | Mouse
GSM1518103
Source NA
Treatment siPABPN1.8N
-
3'READS | Mouse
GSM1518104
Source NA
Treatment siPcf11.8N
-
3'READS | Mouse
GSM1518105
Source NA
Treatment siCtrl1
-
3'READS | Mouse
GSM1518106
Source NA
Treatment siCtrl2
-
3'READS | Mouse
GSM1518108
Source NA
Treatment siCtrl4
-
3'READS | Mouse
GSM1518107
Source NA
Treatment siCtrl3
-
3'READS | Mouse
GSM1518109
Source NA
Treatment siCtrl5
-
3'READS | Mouse
GSM1518110
Source NA
Treatment siCtrl6
-
3'READS | Mouse
GSM1518111
Source NA
Treatment siCtrl7
-
3'READS | Mouse
GSM1518112
Source NA
Treatment mU1D8h
-
3'READS | Mouse
GSM1518113
Source NA
Treatment mU1D24h
-
3'READS | Mouse
GSM1586365
Source NA
Treatment siPABPC1.8T
-
3'READS | Mouse
GSM1586366
Source NA
Treatment siPABPN1.8T
-
3'READS | Mouse
GSM1586367
Source NA
Treatment siPcf11.8T
-
PolyA-seq | Human
GSM747470
Source Brain
Treatment none
-
PolyA-seq | Human
GSM747471
Source Kidney
Treatment none
-
PolyA-seq | Human
GSM747472
Source Liver
Treatment none
-
PolyA-seq | Human
GSM747473
Source MAQC Brain
Treatment none
-
PolyA-seq | Human
GSM747474
Source MAQC Brain
Treatment none
-
PolyA-seq | Human
GSM747475
Source MAQC UHR
Treatment none
-
PolyA-seq | Human
GSM747476
Source MAQC UHR
Treatment none
-
PolyA-seq | Human
GSM747477
Source Muscle
Treatment none
-
PolyA-seq | Human
GSM747479
Source Testis
Treatment none
-
PolyA-seq | Human
GSM747480
Source UHR
Treatment none
-
3P-Seq | Human
GSM1268942
Source HeLa
Treatment none
-
3P-Seq | Human
GSM1268943
Source HEK293
Treatment none
-
3P-Seq | Human
GSM1268944
Source Huh7
Treatment none
-
3P-Seq | Human
GSM1268945
Source IMR90
Treatment none
-
SAPAS | Mouse
SRX480169
Source thymus
Treatment none
-
SAPAS | Mouse
SRX480179
Source thymus
Treatment SAPAS library of mouse thymus at 15.5 embryo day
-
SAPAS | Mouse
SRX480205
Source thymus
Treatment SAPAS library of mouse thymus at 18.5 embryo day
-
SAPAS | Mouse
SRX480212
Source thymus
Treatment SAPAS library of mouse thymus at 0 post-natal day
-
SAPAS | Mouse
SRX480221
Source thymus
Treatment SAPAS library of mouse thymus at 0 post-natal day replication 2
-
SAPAS | Mouse
SRX480227
Source thymus
Treatment SAPAS library of mouse thymus at 6 post-natal day
-
SAPAS | Mouse
SRX480229
Source thymus
Treatment SAPAS library of mouse thymus at 22 post-natal day
-
SAPAS | Mouse
SRX480250
Source thymus
Treatment SAPAS library of mouse thymus at 45 post-natal day
-
SAPAS | Mouse
SRX480287
Source thymus
Treatment SAPAS library of mouse thymus at 90 post-natal day
-
DRS | Human
SRX275752
Source K562
Treatment siRNA aCP
-
DRS | Human
SRX275753
Source K562
Treatment siRNA aCP
-
DRS | Human
SRX275806
Source K562
Treatment siRNA control
-
DRS | Human
SRX275827
Source K562
Treatment siRNA control
-
DRS | Human
GSM1003590
Source HeLa
Treatment siRNA control
-
DRS | Human
GSM1003591
Source HeLa
Treatment siRNA CstF64
-
DRS | Human
GSM1003592
Source HeLa
Treatment siRNA CstF64/CstF64t
-
DRS | Human
SRX388391
Source HeLa
Treatment siRNA Fip1
-
DRS | Human
GSM1366428
Source neuroendocrine tumor
Treatment none
-
DRS | Human
GSM1366429
Source neuroendocrine tumor
Treatment none
-
DRS | Human
GSM1366430
Source Pituitary
Treatment none
-
A-seq | Human
GSM909242
Source HEK293
Treatment none
-
A-seq | Human
GSM909243
Source HEK293
Treatment siRNA Ctrl
-
A-seq | Human
GSM909244
Source HEK293
Treatment siRNA CstF-64
-
A-seq | Human
GSM909245
Source HEK293
Treatment siRNA CF Im68
-
A-seq | Human
GSM986133
Source HEK293
Treatment none
-
A-seq | Human
GSM986134
Source HEK293
Treatment siRNA Control
-
A-seq | Human
GSM986135
Source HEK293
Treatment siRNA CF Im25
-
A-seq | Human
GSM986136
Source HEK293
Treatment siRNA CF Im59
-
A-seq | Human
GSM986137
Source HEK293
Treatment siRNA Ctrl
-
A-seq | Human
GSM986138
Source HEK293
Treatment siRNA CF Im68
Please wait...
PolyAsite takes action
Once established, we used the PolyAsite resource in our own research and developed two tools that exploit the comprehensive annotation of 3' ends.
PAQR
PAQR is a method for the polyadenylation site usage quantification from RNA sequencing data and infers drops in the read coverage profile of RNA-seq libraries at positions of annotated poly(A) sites. In this way, the relative poly(A) site usage can be calculated for genes whose 3' UTR length can vary depending on the chosen cleavage site.
KAPAC
KAPAC stands for k-mer activity on polyadenylation site cchoice and is the method that infers position-dependent activities of sequence motifs on 3' end processing from changes in poly(A) site usage between conditions.
Availability
KAPAC can be applied instantly to 3' end quantifications that were obtain from data sets outlined in the experiments tab. To also allow the application of KAPAC on RNA-seq data, PAQR was developed such that its output feeds perfectly into KAPAC. Both tools are available on https://github.com/zavolanlab/PAQR_KAPAC.
A full use case application of both tools through a snakemake pipeline is available on zenodo. The available archive contains all necessary input data, scripts and software packages to be started out of the box on a linux systems. It can be downloaded from https://doi.org/10.5281/zenodo.1147433
3' end quantification made easy
Using our poly(A) site cluster annotation, quantification of 3' end of transcripts is done in three easy steps.
1.You have sequencing data generated with a dedicated 3'-end sequencing protocol? Use any mapping software you like, for example segemehl or STAR, to create a mapping file in SAM (or its binary equivalent BAM) format.
2. Next, from your SAM or BAM file generate a BED file with samtools and bedtools. Have a look at this sample code:
3. Use our cluster annotation and a PERL script called PAS-quant.pl to get your data analyzed within seconds. The output is a TAB-separated file (TSV) that can easily be read with R or any spreadsheet software.
If you don't have any time to lose: a one-liner to go from your SAM file to 3' end quantification data:
1.You have sequencing data generated with a dedicated 3'-end sequencing protocol? Use any mapping software you like, for example segemehl or STAR, to create a mapping file in SAM (or its binary equivalent BAM) format.
2. Next, from your SAM or BAM file generate a BED file with samtools and bedtools. Have a look at this sample code:
# If you have a SAM file: samtools view -Sb myfile.sam | bamToBed > myfile.bed # If you have already have a BAM file: bamToBed -i myfile.bam > myfile.bed
3. Use our cluster annotation and a PERL script called PAS-quant.pl to get your data analyzed within seconds. The output is a TAB-separated file (TSV) that can easily be read with R or any spreadsheet software.
# Download the PERL script: curl http://polyasite.unibas.ch/download/scripts/PAS-quant.pl > PAS-quant.pl # Download the cluster annotation for your species of interest curl http://polyasite.unibas.ch/download/clusters/Mus_musculus/r1.0/clusters.bed > mouse.bed curl http://polyasite.unibas.ch/download/clusters/Homo_sapiens/r1.0/clusters.bed > human.bed # run the PERL script with your data perl PAS-quant.pl --clusters=human.bed --sample=myfile.bed > output.tsv
If you don't have any time to lose: a one-liner to go from your SAM file to 3' end quantification data:
samtools view -Sb myfile.sam | bamToBed | perl PAS-quant.pl --clusters=human.bed --pipe > output.tsv
Paper
If you use PolyASite in your research, please cite the following publication:
Gruber, A. J. et al. A comprehensive analysis of 3’ end sequencing data sets reveals novel polyadenylation signals and the repressive role of heterogeneous ribonucleoprotein C on cleavage and polyadenylation. Genome Res. 26, 1145–1159 (2016).
Gruber, A. J. et al. A comprehensive analysis of 3’ end sequencing data sets reveals novel polyadenylation signals and the repressive role of heterogeneous ribonucleoprotein C on cleavage and polyadenylation. Genome Res. 26, 1145–1159 (2016).
About PolyASite
PolyASite was built by Manuel Belmadani, Andreas R. Gruber,
Andreas J. Gruber, Ralf Schmidt, and Christina J. Herrmann. If you
have any suggestions on how PolyASite might improve - please let us know: christina.herrmann@unibas.ch