introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 32765283
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 183237071 | GT-AG | 0 | 3.543501303468696e-05 | 735 | rna-XM_004952851.3 32765283 | 1 | 29005330 | 29006064 | Setaria italica 4555 | GAG|GTAAGCTTCG...AGATGATTAACC/AGATGATTAACC...TGCAG|GTA | 0 | 1 | 7.113 |
| 183237072 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_004952851.3 32765283 | 2 | 29006259 | 29006341 | Setaria italica 4555 | GAG|GTGAGGTCCC...CACATCCTAACA/TTAGGGCTTATT...TGCAG|ATA | 2 | 1 | 10.915 |
| 183237073 | GT-AG | 0 | 1.000000099473604e-05 | 793 | rna-XM_004952851.3 32765283 | 3 | 29006460 | 29007252 | Setaria italica 4555 | CAG|GTAATAACCT...TAGTTCTCAGTA/GTAGTTCTCAGT...TACAG|GTT | 0 | 1 | 13.228 |
| 183237074 | GT-AG | 0 | 0.0008547451559157 | 255 | rna-XM_004952851.3 32765283 | 4 | 29007370 | 29007624 | Setaria italica 4555 | CAG|GTATGCGGAA...TGAATTTTAATA/TAGTTGCTTATT...AACAG|ATA | 0 | 1 | 15.52 |
| 183237075 | GT-AG | 0 | 1.4143137174397727e-05 | 125 | rna-XM_004952851.3 32765283 | 5 | 29007730 | 29007854 | Setaria italica 4555 | CTG|GTAATATTTC...TTGTTATTAATT/TTGTTATTAATT...ACCAG|GTG | 0 | 1 | 17.578 |
| 183237076 | GT-AG | 0 | 0.0049518376418936 | 2086 | rna-XM_004952851.3 32765283 | 6 | 29007912 | 29009997 | Setaria italica 4555 | AAG|GTATTCCCTT...ATATCCCTGATC/CTGGCACTTATT...TGCAG|GAG | 0 | 1 | 18.695 |
| 183237077 | GT-AG | 0 | 0.0001403510995159 | 253 | rna-XM_004952851.3 32765283 | 7 | 29010067 | 29010319 | Setaria italica 4555 | ATG|GTATGATTCT...CTAATATTATCT/TCTTGTGTCATT...TATAG|GAC | 0 | 1 | 20.047 |
| 183237078 | GT-AG | 0 | 0.0010854169495868 | 63 | rna-XM_004952851.3 32765283 | 8 | 29010428 | 29010490 | Setaria italica 4555 | ACG|GTAAGCTTAA...TTGCCTTTAGCA/ATGCTTCTGATC...TTTAG|TAC | 0 | 1 | 22.163 |
| 183237079 | GT-AG | 0 | 1.000000099473604e-05 | 214 | rna-XM_004952851.3 32765283 | 9 | 29010686 | 29010899 | Setaria italica 4555 | TCA|GTGAGTGGCA...GGAGTCTGGACT/TCAAGTTTCACT...TGTAG|GTG | 0 | 1 | 25.985 |
| 183237080 | GT-AG | 0 | 1.000000099473604e-05 | 368 | rna-XM_004952851.3 32765283 | 10 | 29010999 | 29011366 | Setaria italica 4555 | CAG|GTACTGGTTA...CCCATCTCAACT/ACTGTTTTCATT...TTCAG|GTC | 0 | 1 | 27.925 |
| 183237081 | GT-AG | 0 | 0.0003894118439196 | 130 | rna-XM_004952851.3 32765283 | 11 | 29011934 | 29012063 | Setaria italica 4555 | CAG|GTTTTTCTCC...ATTATTTTAACT/ATTATTTTAACT...GGCAG|ATT | 0 | 1 | 39.036 |
| 183237082 | GT-AG | 0 | 1.000000099473604e-05 | 417 | rna-XM_004952851.3 32765283 | 12 | 29012289 | 29012705 | Setaria italica 4555 | GTG|GTATGAGCTG...TTTGTTCTGACA/TTTGTTCTGACA...TCCAG|GTT | 0 | 1 | 43.445 |
| 183237083 | GT-AG | 0 | 0.0002932495343589 | 430 | rna-XM_004952851.3 32765283 | 13 | 29013009 | 29013438 | Setaria italica 4555 | CAT|GTAAGTTGTT...ACAACTTTGACT/ACTGATCTCACA...GTCAG|GTA | 0 | 1 | 49.383 |
| 183237084 | GT-AG | 0 | 1.000000099473604e-05 | 194 | rna-XM_004952851.3 32765283 | 14 | 29013507 | 29013700 | Setaria italica 4555 | AAG|GTTTGAAGTT...ATGCTGTTGACA/TGTATATTCATT...TGAAG|GGC | 2 | 1 | 50.715 |
| 183237085 | GT-AG | 0 | 2.4548456776208914e-05 | 284 | rna-XM_004952851.3 32765283 | 15 | 29013860 | 29014143 | Setaria italica 4555 | CAG|GTAACATCAA...TTGTTCTTCTCA/GTTCTTCTCATT...TGCAG|GGT | 2 | 1 | 53.831 |
| 183237086 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_004952851.3 32765283 | 16 | 29014270 | 29014356 | Setaria italica 4555 | TAG|GTTCTTATCT...CTTTTTATAATT/CTTTTTATAATT...TGCAG|AAT | 2 | 1 | 56.3 |
| 183237087 | GT-AG | 0 | 0.0015686368193735 | 88 | rna-XM_004952851.3 32765283 | 17 | 29014489 | 29014576 | Setaria italica 4555 | CAG|GTATATGTGA...TTGCCTTTGAAA/TGCATTTTCAAC...AACAG|AAT | 2 | 1 | 58.887 |
| 183237088 | GT-AG | 0 | 6.901477076655109e-05 | 734 | rna-XM_004952851.3 32765283 | 18 | 29014686 | 29015419 | Setaria italica 4555 | GAG|GTTTGTCTCA...TTAGTTTTGATC/TTAGTTTTGATC...AACAG|GTT | 0 | 1 | 61.023 |
| 183237089 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_004952851.3 32765283 | 19 | 29015573 | 29015648 | Setaria italica 4555 | CCT|GTAAGAGAAC...GACTTCTCAACA/TTAGCACTGACT...CTCAG|CCT | 0 | 1 | 64.021 |
| 183237090 | GT-AG | 0 | 0.0025860733536493 | 111 | rna-XM_004952851.3 32765283 | 20 | 29015892 | 29016002 | Setaria italica 4555 | AAG|GTAACTATTC...TTTTCCTTTTCA/TTTTTGCTAATC...GCAAG|GTT | 0 | 1 | 68.783 |
| 183237091 | GT-AG | 0 | 1.000000099473604e-05 | 187 | rna-XM_004952851.3 32765283 | 21 | 29016171 | 29016357 | Setaria italica 4555 | GAG|GTAATACACC...CGTTTTTTGACT/CGTTTTTTGACT...TACAG|GAA | 0 | 1 | 72.075 |
| 183237092 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_004952851.3 32765283 | 22 | 29016502 | 29016589 | Setaria italica 4555 | CTT|GTGAGTACTG...CTTGCTTTAATT/CTTGCTTTAATT...CTCAG|GTT | 0 | 1 | 74.897 |
| 183237093 | GT-AG | 0 | 0.000431752603404 | 224 | rna-XM_004952851.3 32765283 | 23 | 29017052 | 29017275 | Setaria italica 4555 | AAT|GTAAGCATTC...TGTACATTAACT/TGTACATTAACT...TGCAG|GAA | 0 | 1 | 83.951 |
| 183237094 | GT-AG | 0 | 0.0083063232447675 | 75 | rna-XM_004952851.3 32765283 | 24 | 29017573 | 29017647 | Setaria italica 4555 | CAG|GTAACTTTGT...TGATTCTAAGCT/CTTATGCTGATT...GACAG|ATT | 0 | 1 | 89.771 |
| 183237095 | GT-AG | 0 | 1.000000099473604e-05 | 114 | rna-XM_004952851.3 32765283 | 25 | 29017806 | 29017919 | Setaria italica 4555 | CAG|GTAATAGCCT...TTTTCCTTTATG/TTTTCCTTTATG...GCCAG|TCT | 2 | 1 | 92.867 |
| 183237096 | GT-AG | 0 | 1.000000099473604e-05 | 219 | rna-XM_004952851.3 32765283 | 26 | 29018028 | 29018246 | Setaria italica 4555 | AAA|GTAAGATCAC...GTTTTTTTAAAG/CAGAATCTCATG...ACCAG|GTC | 2 | 1 | 94.983 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);