introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 1415786
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7708377 | GT-AG | 0 | 1.91158172195362e-05 | 81 | rna-XM_019995238.1 1415786 | 1 | 27596 | 27676 | Amphimedon queenslandica 400682 | TAG|GTAATTTAAT...GCCTTTTTGACT/TTTTGACTAATT...TATAG|GCT | 1 | 1 | 0.765 |
| 7708378 | GT-AG | 0 | 1.000000099473604e-05 | 148 | rna-XM_019995238.1 1415786 | 2 | 27166 | 27313 | Amphimedon queenslandica 400682 | TAA|GTGAGTCAAT...TAATCCTTAATC/CTTAATCTAACT...TATAG|AAC | 1 | 1 | 8.47 |
| 7708379 | GT-AG | 0 | 3.3909207571587546e-05 | 285 | rna-XM_019995238.1 1415786 | 3 | 26590 | 26874 | Amphimedon queenslandica 400682 | TCA|GTAAGTATGA...GTTTTGTTATTG/ATGTTGTTTATC...TTTAG|GTA | 1 | 1 | 16.421 |
| 7708380 | GT-AG | 0 | 0.0001137679566117 | 212 | rna-XM_019995238.1 1415786 | 4 | 26300 | 26511 | Amphimedon queenslandica 400682 | AAG|GTACATTAAT...ATTTATTTAATT/ATTTATTTAATT...TATAG|ACA | 1 | 1 | 18.552 |
| 7708381 | GT-AG | 0 | 7.099754127043814e-05 | 388 | rna-XM_019995238.1 1415786 | 5 | 25852 | 26239 | Amphimedon queenslandica 400682 | AAG|GTATGTTACA...CATACGGTAATA/AAAAAATTAATG...TATAG|TTG | 1 | 1 | 20.191 |
| 7708382 | GT-AG | 0 | 0.0005254399612004 | 74 | rna-XM_019995238.1 1415786 | 6 | 25113 | 25186 | Amphimedon queenslandica 400682 | AAG|GTACACATAC...TTGTTGTTAATA/TTGTTGTTAATA...TATAG|GAA | 0 | 1 | 38.361 |
| 7708383 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_019995238.1 1415786 | 7 | 24980 | 25070 | Amphimedon queenslandica 400682 | GAG|GTCAGAAAAT...TATTTGTTAATT/TATTTGTTAATT...TTCAG|GTT | 0 | 1 | 39.508 |
| 7708384 | GT-AG | 0 | 0.0013024481320443 | 57 | rna-XM_019995238.1 1415786 | 8 | 24854 | 24910 | Amphimedon queenslandica 400682 | AAG|GTATGTTATG...TTTCTCTTTGTA/ATGTTGATGATT...AATAG|AAA | 0 | 1 | 41.393 |
| 7708385 | GT-AG | 0 | 1.000000099473604e-05 | 989 | rna-XM_019995238.1 1415786 | 9 | 23808 | 24796 | Amphimedon queenslandica 400682 | GAG|GTTAGCTGGT...ATGTATTTGATT/ATGTATTTGATT...TTTAG|GAA | 0 | 1 | 42.951 |
| 7708386 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-XM_019995238.1 1415786 | 10 | 23608 | 23656 | Amphimedon queenslandica 400682 | CAG|GTTAGAGTAT...TAGTTTTTATTA/TTTTTATTAATT...TAAAG|CTG | 1 | 1 | 47.077 |
| 7708387 | GT-AG | 0 | 0.0002799093169305 | 62 | rna-XM_019995238.1 1415786 | 11 | 23345 | 23406 | Amphimedon queenslandica 400682 | AAG|GTTTTATTGA...TGATTCTTAATG/TTGATTCTTAAT...AATAG|ATT | 1 | 1 | 52.568 |
| 7708388 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-XM_019995238.1 1415786 | 12 | 23227 | 23276 | Amphimedon queenslandica 400682 | AAA|GTAAGGCAAA...TGTACCTTTTCT/ACCTTTTCTATT...TATAG|TTC | 0 | 1 | 54.426 |
| 7708389 | GT-AG | 0 | 0.0006369541813398 | 52 | rna-XM_019995238.1 1415786 | 13 | 23070 | 23121 | Amphimedon queenslandica 400682 | AAG|GTTTATTTTT...TGTTTCTCATCT/TTGTTTCTCATC...TTTAG|TTA | 0 | 1 | 57.295 |
| 7708390 | GT-AG | 0 | 0.0124144364232898 | 45 | rna-XM_019995238.1 1415786 | 14 | 22926 | 22970 | Amphimedon queenslandica 400682 | GAA|GTAAGCTTTA...GTTATCTTATTA/AGTTATCTTATT...CATAG|TTT | 0 | 1 | 60.0 |
| 7708391 | GT-AG | 0 | 1.000000099473604e-05 | 77 | rna-XM_019995238.1 1415786 | 15 | 22743 | 22819 | Amphimedon queenslandica 400682 | GAG|GTGGGTCACT...GTTGCCATGATA/TTGTGTTTGATG...TGTAG|GCA | 1 | 1 | 62.896 |
| 7708392 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_019995238.1 1415786 | 16 | 22508 | 22626 | Amphimedon queenslandica 400682 | GAA|GTAAGAGGAA...TCTCCCTTAAAC/GTCTCCCTTAAA...TATAG|GTA | 0 | 1 | 66.066 |
| 7708393 | GT-AG | 0 | 1.000000099473604e-05 | 215 | rna-XM_019995238.1 1415786 | 17 | 22146 | 22360 | Amphimedon queenslandica 400682 | CAA|GTACTAGCTA...CTCTCCTTTCTA/ACACAACTTATT...CCTAG|TTT | 0 | 1 | 70.082 |
| 7708394 | GT-AG | 0 | 1.000000099473604e-05 | 45 | rna-XM_019995238.1 1415786 | 18 | 21890 | 21934 | Amphimedon queenslandica 400682 | GAA|GTAAGAATTT...ATTACATTATTC/CATTACATTATT...TACAG|AGG | 1 | 1 | 75.847 |
| 7708395 | GT-AG | 0 | 0.0049078223107847 | 125 | rna-XM_019995238.1 1415786 | 19 | 21592 | 21716 | Amphimedon queenslandica 400682 | GAG|GTATAGTTTT...ATATTTTTATTT/TTATATTTCATT...CCTAG|TTG | 0 | 1 | 80.574 |
| 7708396 | GT-AG | 0 | 1.000000099473604e-05 | 47 | rna-XM_019995238.1 1415786 | 20 | 21455 | 21501 | Amphimedon queenslandica 400682 | GAG|GTATGAGTAT...ATTACTATAATT/ATTACTATAATT...TGTAG|TTT | 0 | 1 | 83.033 |
| 7708397 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-XM_019995238.1 1415786 | 21 | 21279 | 21335 | Amphimedon queenslandica 400682 | CAA|GTAAGTGCAT...TATCATGTGACT/CATGTGATCATC...TGTAG|AAT | 2 | 1 | 86.284 |
| 7708398 | GT-AG | 0 | 1.000000099473604e-05 | 317 | rna-XM_019995238.1 1415786 | 22 | 20861 | 21177 | Amphimedon queenslandica 400682 | AAC|GTCAGTATAT...ATTATTTTATAT/CATGTATTTATA...ATTAG|GGC | 1 | 1 | 89.044 |
| 7708399 | GT-AG | 0 | 1.000000099473604e-05 | 662 | rna-XM_019995238.1 1415786 | 23 | 20068 | 20729 | Amphimedon queenslandica 400682 | AAG|GTAATTCATG...GGTTCTTTTACA/GGTTCTTTTACA...TGTAG|CTG | 0 | 1 | 92.623 |
| 7708400 | GT-AG | 0 | 1.315753954604119e-05 | 142 | rna-XM_019995238.1 1415786 | 24 | 19878 | 20019 | Amphimedon queenslandica 400682 | AAG|GTATGATAAT...CTCTCCCTCTCT/CTCTCTCTCTCT...CTTAG|AGT | 0 | 1 | 93.934 |
| 7708401 | GT-AG | 0 | 1.000000099473604e-05 | 66 | rna-XM_019995238.1 1415786 | 25 | 19693 | 19758 | Amphimedon queenslandica 400682 | CAA|GTGATTAAAA...ACTGTTTTAATT/ACTGTTTTAATT...TTTAG|AGA | 2 | 1 | 97.186 |
| 7708402 | GT-AG | 0 | 1.000000099473604e-05 | 229 | rna-XM_019995238.1 1415786 | 26 | 19392 | 19620 | Amphimedon queenslandica 400682 | AAG|GTCAGTCATT...ATACTCATATAT/TGTATACTCATA...TGTAG|GTT | 2 | 1 | 99.153 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);