introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
33 rows where transcript_id = 32765304
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 183237438 | GT-AG | 0 | 1.000000099473604e-05 | 1087 | rna-XM_004953324.4 32765304 | 1 | 33752365 | 33753451 | Setaria italica 4555 | TCG|GTGAGTGTCC...TAGGCTTTGATT/TAGGCTTTGATT...GACAG|ATT | 1 | 1 | 8.826 |
| 183237439 | GT-AG | 0 | 0.0005767253806858 | 117 | rna-XM_004953324.4 32765304 | 2 | 33753531 | 33753647 | Setaria italica 4555 | TTG|GTACTATTGA...CTTCTCTTGAAC/GATGTGTTCATT...TGCAG|CAC | 2 | 1 | 10.774 |
| 183237440 | GT-AG | 0 | 1.000000099473604e-05 | 341 | rna-XM_004953324.4 32765304 | 3 | 33753722 | 33754062 | Setaria italica 4555 | CAG|GTTGGTCAAA...TCTTCCCTGATA/TCTTCCCTGATA...CCCAG|GTG | 1 | 1 | 12.599 |
| 183237441 | GT-AG | 0 | 0.0021494549284057 | 85 | rna-XM_004953324.4 32765304 | 4 | 33754173 | 33754257 | Setaria italica 4555 | AAG|GTAAACTTAT...GAGATCTTAACA/TTTATATTGAAG...TTCAG|AAA | 0 | 1 | 15.311 |
| 183237442 | GT-AG | 0 | 5.687250166754806e-05 | 164 | rna-XM_004953324.4 32765304 | 5 | 33754339 | 33754502 | Setaria italica 4555 | AAG|GTTCACGTCC...TCACCTTTGATG/GAAGTTCTCACC...TGCAG|AAA | 0 | 1 | 17.308 |
| 183237443 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_004953324.4 32765304 | 6 | 33754590 | 33754665 | Setaria italica 4555 | GAA|GTAAGAAAAA...ACTTTTCTAGCT/AGCTGGCTTATA...TGTAG|GGT | 0 | 1 | 19.453 |
| 183237444 | GT-AG | 0 | 4.723974529369308e-05 | 745 | rna-XM_004953324.4 32765304 | 7 | 33754809 | 33755553 | Setaria italica 4555 | CAG|GTATGGTTAA...AATGTCTGATCT/ATGTTTCTGATA...TGCAG|GCT | 2 | 1 | 22.978 |
| 183237445 | GT-AG | 0 | 0.000966908207029 | 78 | rna-XM_004953324.4 32765304 | 8 | 33755705 | 33755782 | Setaria italica 4555 | GAT|GTACGTTACT...GTTGTGTTGATA/GTTGTGTTGATA...CACAG|GAA | 0 | 1 | 26.701 |
| 183237446 | GT-AG | 0 | 1.000000099473604e-05 | 208 | rna-XM_004953324.4 32765304 | 9 | 33755906 | 33756113 | Setaria italica 4555 | GAG|GTAATAATAC...GATTCCTTAAAT/TAAATTCTGACA...TCCAG|CAC | 0 | 1 | 29.734 |
| 183237447 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_004953324.4 32765304 | 10 | 33756198 | 33756308 | Setaria italica 4555 | GAG|GTGAGATCCT...TCTATCTTAGAA/TTCTATCTTAGA...CACAG|GTT | 0 | 1 | 31.805 |
| 183237448 | GT-AG | 0 | 0.0045360180770475 | 403 | rna-XM_004953324.4 32765304 | 11 | 33756497 | 33756899 | Setaria italica 4555 | CTG|GTATGTTGTT...CTGTTTTTAGAC/CCTGTTTTTAGA...TTAAG|GTC | 2 | 1 | 36.44 |
| 183237449 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_004953324.4 32765304 | 12 | 33757075 | 33757185 | Setaria italica 4555 | AAG|GTTTGTGTGG...CTCTTCTAATTT/GCTCTTCTAATT...TTCAG|GCC | 0 | 1 | 40.754 |
| 183237450 | GT-AG | 0 | 1.000000099473604e-05 | 103 | rna-XM_004953324.4 32765304 | 13 | 33757308 | 33757410 | Setaria italica 4555 | TAG|GTTAGTTGCT...ATAGTGTTGATT/ATAGTGTTGATT...GGCAG|GGC | 2 | 1 | 43.762 |
| 183237451 | GT-AG | 0 | 1.000000099473604e-05 | 380 | rna-XM_004953324.4 32765304 | 14 | 33757491 | 33757870 | Setaria italica 4555 | CAA|GTAAGTGCCC...GATAACATGACA/TGATATATTATG...TGCAG|TTC | 1 | 1 | 45.735 |
| 183237452 | GT-AG | 0 | 1.000000099473604e-05 | 294 | rna-XM_004953324.4 32765304 | 15 | 33757930 | 33758223 | Setaria italica 4555 | GAG|GTCAGTTCCT...ATATTCATAACT/ATTATATTCATA...TTCAG|TGG | 0 | 1 | 47.189 |
| 183237453 | GT-AG | 0 | 1.000000099473604e-05 | 435 | rna-XM_004953324.4 32765304 | 16 | 33758379 | 33758813 | Setaria italica 4555 | CAA|GTTGGTACCT...AACTTCTTAATT/AACTTCTTAATT...TGCAG|CAT | 2 | 1 | 51.011 |
| 183237454 | GT-AG | 0 | 0.0006977046012561 | 315 | rna-XM_004953324.4 32765304 | 17 | 33758993 | 33759307 | Setaria italica 4555 | CTG|GTACTCATTT...TCTATTTTGGTT/ATTTTGGTTACC...TACAG|GTC | 1 | 1 | 55.424 |
| 183237455 | GT-AG | 0 | 0.0412535483103558 | 68 | rna-XM_004953324.4 32765304 | 18 | 33759421 | 33759488 | Setaria italica 4555 | CAG|GTATACCACA...TTTATCTTAGTT/TTTTATCTTAGT...GCCAG|ATC | 0 | 1 | 58.21 |
| 183237456 | GT-AG | 0 | 0.0001285217182658 | 144 | rna-XM_004953324.4 32765304 | 19 | 33759660 | 33759803 | Setaria italica 4555 | GAG|GTACTTCTAT...CACTCGTTAATA/ATTTATTTCAAA...TGCAG|ATT | 0 | 1 | 62.426 |
| 183237457 | GT-AG | 0 | 1.000000099473604e-05 | 73 | rna-XM_004953324.4 32765304 | 20 | 33759933 | 33760005 | Setaria italica 4555 | AAG|GTAAGTCGAG...TCAGTTTTATTT/CATATATTGATT...TGCAG|ATC | 0 | 1 | 65.607 |
| 183237458 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_004953324.4 32765304 | 21 | 33760107 | 33760191 | Setaria italica 4555 | AAC|GTGAGTTGAT...TTTTTTTTAACT/TTTTTTTTAACT...TGCAG|TTA | 2 | 1 | 68.097 |
| 183237459 | GT-AG | 0 | 1.000000099473604e-05 | 368 | rna-XM_004953324.4 32765304 | 22 | 33760310 | 33760677 | Setaria italica 4555 | AAG|GTGCATTTAT...GTTCTCTTTGCA/GAAAAATTGACA...AGCAG|CGT | 0 | 1 | 71.006 |
| 183237460 | GT-AG | 0 | 1.000000099473604e-05 | 1794 | rna-XM_004953324.4 32765304 | 23 | 33760758 | 33762551 | Setaria italica 4555 | AAG|GTAAATCTCT...TATTTATTAATA/TATTTATTAATA...TGTAG|GCA | 2 | 1 | 72.978 |
| 183237461 | GT-AG | 0 | 0.4151533928288182 | 90 | rna-XM_004953324.4 32765304 | 24 | 33762625 | 33762714 | Setaria italica 4555 | AAG|GTATCCCATC...AATGATTTAGCA/GATGTTATGACC...CACAG|GAT | 0 | 1 | 74.778 |
| 183237462 | GT-AG | 0 | 0.0056822697920714 | 108 | rna-XM_004953324.4 32765304 | 25 | 33762801 | 33762908 | Setaria italica 4555 | AGG|GTATTTCCTA...ATTTCTTTACAT/CATTTCTTTACA...TTCAG|AGA | 2 | 1 | 76.898 |
| 183237463 | GT-AG | 0 | 0.0016593045403987 | 91 | rna-XM_004953324.4 32765304 | 26 | 33762961 | 33763051 | Setaria italica 4555 | AAG|GTATATATCC...TATCTGTTGATT/TATCTGTTGATT...TGCAG|AAT | 0 | 1 | 78.18 |
| 183237464 | GT-AG | 0 | 1.000000099473604e-05 | 197 | rna-XM_004953324.4 32765304 | 27 | 33763184 | 33763380 | Setaria italica 4555 | AAG|GTAACACCAA...CATACTTTCATT/CTTTCATTCATT...CGCAG|GTT | 0 | 1 | 81.435 |
| 183237465 | GT-AG | 0 | 1.000000099473604e-05 | 221 | rna-XM_004953324.4 32765304 | 28 | 33763450 | 33763670 | Setaria italica 4555 | GAG|GTTGGTCCGC...CAATCCTTACTA/ACAATCCTTACT...ACTAG|GTT | 0 | 1 | 83.136 |
| 183237466 | GT-AG | 0 | 3.746911864720706e-05 | 474 | rna-XM_004953324.4 32765304 | 29 | 33763764 | 33764237 | Setaria italica 4555 | AAG|GTAGGCGTTG...CCTTTCTTGGCA/ACTAAGTTAACC...TGCAG|TCA | 0 | 1 | 85.429 |
| 183237467 | GT-AG | 0 | 0.0003610496862759 | 496 | rna-XM_004953324.4 32765304 | 30 | 33764331 | 33764826 | Setaria italica 4555 | GAG|GTAACTACTA...CTTTTTTTCACA/CTTTTTTTCACA...TTCAG|GTT | 0 | 1 | 87.722 |
| 183237468 | GT-AG | 0 | 1.000000099473604e-05 | 96 | rna-XM_004953324.4 32765304 | 31 | 33764926 | 33765021 | Setaria italica 4555 | CAG|GTAATTGGAT...ACTACCTTAAAT/AAATATTTTACT...GATAG|AAA | 0 | 1 | 90.163 |
| 183237469 | GT-AG | 0 | 1.000000099473604e-05 | 349 | rna-XM_004953324.4 32765304 | 32 | 33765115 | 33765463 | Setaria italica 4555 | AAG|GTGCAGAATC...TTTGTCATAATT/ATTTTTGTCATA...GACAG|TCT | 0 | 1 | 92.456 |
| 183237470 | GT-AG | 0 | 0.0060582771883959 | 949 | rna-XM_004953324.4 32765304 | 33 | 33765614 | 33766562 | Setaria italica 4555 | AAG|GTATATTCTG...GTTTTCATGATG/TTTGTGCTAATA...TCCAG|GTT | 0 | 1 | 96.154 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);