introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
36 rows where transcript_id = 9059433
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48954661 | GT-AG | 0 | 1.000000099473604e-05 | 301 | rna-XM_036566144.1 9059433 | 1 | 26798529 | 26798829 | Colossoma macropomum 42526 | CAG|GTAAGTCTGT...ATGTGTTTAAAA/ATGTGTTTAAAA...TGCAG|GAA | 1 | 1 | 0.165 |
| 48954662 | GT-AG | 0 | 1.000000099473604e-05 | 984 | rna-XM_036566144.1 9059433 | 2 | 26798867 | 26799850 | Colossoma macropomum 42526 | TGG|GTAAGGAGTC...TGTGTTTTGTTT/TTGTTTTTCCTC...CTCAG|GGG | 2 | 1 | 1.037 |
| 48954663 | GT-AG | 0 | 1.000000099473604e-05 | 10063 | rna-XM_036566144.1 9059433 | 3 | 26799905 | 26809967 | Colossoma macropomum 42526 | CTG|GTGAGAGAGG...TGATTTTTGTAT/GTGGTACTCAGA...TTCAG|GAG | 2 | 1 | 2.31 |
| 48954664 | GT-AG | 0 | 1.000000099473604e-05 | 2442 | rna-XM_036566144.1 9059433 | 4 | 26810152 | 26812593 | Colossoma macropomum 42526 | AAT|GTGAGTCTTT...GCATTTTTGCCT/TGAAGGTTCATT...TGCAG|GAT | 0 | 1 | 6.648 |
| 48954665 | GT-AG | 0 | 1.000000099473604e-05 | 1122 | rna-XM_036566144.1 9059433 | 5 | 26812729 | 26813850 | Colossoma macropomum 42526 | GAA|GTGAGTCATC...TAAATCATGAAG/GCGGAGCTGAAT...CTCAG|CTG | 0 | 1 | 9.83 |
| 48954666 | GT-AG | 0 | 1.000000099473604e-05 | 992 | rna-XM_036566144.1 9059433 | 6 | 26813956 | 26814947 | Colossoma macropomum 42526 | TTG|GTAAGACATA...GAAACCGTACTA/AACATTTCCACT...TCCAG|GGC | 0 | 1 | 12.306 |
| 48954667 | GT-AG | 0 | 1.000000099473604e-05 | 810 | rna-XM_036566144.1 9059433 | 7 | 26815048 | 26815857 | Colossoma macropomum 42526 | TTG|GTAATGGCTG...TGAGCCTTGTTG/TCGAATCTCATT...TCCAG|GTG | 1 | 1 | 14.663 |
| 48954668 | GT-AG | 0 | 1.000000099473604e-05 | 2350 | rna-XM_036566144.1 9059433 | 8 | 26815935 | 26818284 | Colossoma macropomum 42526 | GAG|GTTTGTATCT...TGTCTCTGTGCT/GGTCTGTTTATG...TGCAG|GTG | 0 | 1 | 16.478 |
| 48954669 | GT-AG | 0 | 1.000000099473604e-05 | 12636 | rna-XM_036566144.1 9059433 | 9 | 26818431 | 26831066 | Colossoma macropomum 42526 | ACG|GTGAGACACC...CTGACCTTATTC/TCTGTTCTGACC...CACAG|AAA | 2 | 1 | 19.92 |
| 48954670 | GT-AG | 0 | 9.141036785558363e-05 | 3808 | rna-XM_036566144.1 9059433 | 10 | 26831133 | 26834940 | Colossoma macropomum 42526 | TCA|GTAAGTACTG...TTGCTCTTACTC/CTTACTCTCATT...TGCAG|GTG | 2 | 1 | 21.476 |
| 48954671 | GT-AG | 0 | 0.0005914143862696 | 3966 | rna-XM_036566144.1 9059433 | 11 | 26835097 | 26839062 | Colossoma macropomum 42526 | CAA|GTATGTGCCT...CACTCTGTAACT/CTGTAACTGAAA...CACAG|GCA | 2 | 1 | 25.153 |
| 48954672 | GT-AG | 0 | 1.000000099473604e-05 | 214 | rna-XM_036566144.1 9059433 | 12 | 26839148 | 26839361 | Colossoma macropomum 42526 | GAG|GTTAGTTATT...TATGTCTTGGTA/TTATGTGTAATT...CACAG|GAA | 0 | 1 | 27.157 |
| 48954673 | GT-AG | 0 | 1.000000099473604e-05 | 1306 | rna-XM_036566144.1 9059433 | 13 | 26839479 | 26840784 | Colossoma macropomum 42526 | CAG|GTCAGTGCAG...TAAAGTTTATTT/TTAAAGTTTATT...TACAG|TTC | 0 | 1 | 29.915 |
| 48954674 | GT-AG | 0 | 0.0012659682865822 | 186 | rna-XM_036566144.1 9059433 | 14 | 26840890 | 26841075 | Colossoma macropomum 42526 | CAG|GTACCTACCT...CTTGTCTTTCTG/GAATGTGTTATA...TTTAG|TGC | 0 | 1 | 32.39 |
| 48954675 | GT-AG | 0 | 1.000000099473604e-05 | 1342 | rna-XM_036566144.1 9059433 | 15 | 26841160 | 26842501 | Colossoma macropomum 42526 | AAG|GTATGACAGC...AAAATGTTAACA/TTAACACTCATA...TGCAG|GCA | 0 | 1 | 34.371 |
| 48954676 | GT-AG | 0 | 1.000000099473604e-05 | 141 | rna-XM_036566144.1 9059433 | 16 | 26842705 | 26842845 | Colossoma macropomum 42526 | CAC|GTGAGAAATG...CATTTCTTGATC/CATTTCTTGATC...GGCAG|AGG | 2 | 1 | 39.156 |
| 48954677 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_036566144.1 9059433 | 17 | 26842945 | 26843032 | Colossoma macropomum 42526 | CAG|GTCTGAAACT...CAGATATTGATC/CTAGTGCTCACA...TCCAG|GTA | 2 | 1 | 41.49 |
| 48954678 | GT-AG | 0 | 9.935816720368894e-05 | 82 | rna-XM_036566144.1 9059433 | 18 | 26843148 | 26843229 | Colossoma macropomum 42526 | GAG|GTATAGTATG...TTATTTTCAATC/TTTATTTTCAAT...TACAG|GAG | 0 | 1 | 44.201 |
| 48954679 | GT-AG | 0 | 1.000000099473604e-05 | 102 | rna-XM_036566144.1 9059433 | 19 | 26843357 | 26843458 | Colossoma macropomum 42526 | ATG|GTAAGTTCAG...CATGCGTTGACC/TGCTTTCTAACT...TGCAG|CGG | 1 | 1 | 47.195 |
| 48954680 | GT-AG | 0 | 1.000000099473604e-05 | 3706 | rna-XM_036566144.1 9059433 | 20 | 26843670 | 26847375 | Colossoma macropomum 42526 | CTG|GTAAGTGGAT...ATTAGCTTAATG/GTTGTGTTCATG...TGAAG|TGC | 2 | 1 | 52.169 |
| 48954681 | GT-AG | 0 | 1.000000099473604e-05 | 1995 | rna-XM_036566144.1 9059433 | 21 | 26847524 | 26849518 | Colossoma macropomum 42526 | CAG|GTGAGACACT...AATACCTGACAG/TGACAGCTAACA...CTCAG|ATA | 0 | 1 | 55.658 |
| 48954682 | GT-AG | 0 | 5.971637810905386e-05 | 193 | rna-XM_036566144.1 9059433 | 22 | 26849673 | 26849865 | Colossoma macropomum 42526 | TTG|GTAACACGCT...GTTTACTTATTC/TGTTTACTTATT...TCTAG|ACA | 1 | 1 | 59.288 |
| 48954683 | GT-AG | 0 | 1.000000099473604e-05 | 1757 | rna-XM_036566144.1 9059433 | 23 | 26849955 | 26851711 | Colossoma macropomum 42526 | AAG|GTAACACAGT...TTTTTCTCAAAT/TTTTTTCTCAAA...CAAAG|GTG | 0 | 1 | 61.386 |
| 48954684 | GT-AG | 0 | 1.000000099473604e-05 | 11505 | rna-XM_036566144.1 9059433 | 24 | 26851842 | 26863346 | Colossoma macropomum 42526 | CAG|GTATGACCCA...GATGCTTTAAGA/GATAAGCTCATA...ACCAG|GTA | 1 | 1 | 64.451 |
| 48954685 | GT-AG | 0 | 1.000000099473604e-05 | 4485 | rna-XM_036566144.1 9059433 | 25 | 26863427 | 26867911 | Colossoma macropomum 42526 | AAG|GTACGTACTC...CATGTGTTAATA/CATGTGTTAATA...TTTAG|GTG | 0 | 1 | 66.337 |
| 48954686 | GT-AG | 0 | 1.000000099473604e-05 | 812 | rna-XM_036566144.1 9059433 | 26 | 26867990 | 26868801 | Colossoma macropomum 42526 | CGC|GTGAGTGTCC...GAGTTCTTTCTG/TTCTTTCTGAAG...GGCAG|TAC | 0 | 1 | 68.175 |
| 48954687 | GT-AG | 0 | 1.000000099473604e-05 | 3156 | rna-XM_036566144.1 9059433 | 27 | 26869008 | 26872163 | Colossoma macropomum 42526 | CAG|GTAGGAGAGC...TTTTCCTGCTCT/CATCTGTTCAAA...CTCAG|GTA | 2 | 1 | 73.032 |
| 48954688 | GT-AG | 0 | 1.000000099473604e-05 | 512 | rna-XM_036566144.1 9059433 | 28 | 26872276 | 26872787 | Colossoma macropomum 42526 | AAG|GTACAGTCAT...GTGTACTTCATG/GTGTACTTCATG...GACAG|GTG | 0 | 1 | 75.672 |
| 48954689 | GT-AG | 0 | 1.000000099473604e-05 | 784 | rna-XM_036566144.1 9059433 | 29 | 26872951 | 26873734 | Colossoma macropomum 42526 | CAG|GTACAAAACA...TTCTCTTTCTCT/TCTTCTCTCATA...CGCAG|CCT | 1 | 1 | 79.514 |
| 48954690 | GT-AG | 0 | 1.000000099473604e-05 | 3710 | rna-XM_036566144.1 9059433 | 30 | 26873816 | 26877525 | Colossoma macropomum 42526 | CAG|GTACTGACCT...TCTGACTTGAAG/TGGTGTCTGACT...TTCAG|CAT | 1 | 1 | 81.424 |
| 48954691 | GT-AG | 0 | 4.290769178054906e-05 | 2587 | rna-XM_036566144.1 9059433 | 31 | 26877602 | 26880188 | Colossoma macropomum 42526 | CCG|GTACGTCCAA...ATATTTTTATTT/TATATTTTTATT...CTCAG|GGA | 2 | 1 | 83.215 |
| 48954692 | GT-AG | 0 | 1.000000099473604e-05 | 993 | rna-XM_036566144.1 9059433 | 32 | 26880238 | 26881230 | Colossoma macropomum 42526 | GAG|GTCAGAAATG...TTTCTTTTGACA/TTTCTTTTGACA...TTAAG|GTT | 0 | 1 | 84.371 |
| 48954693 | GT-AG | 0 | 1.000000099473604e-05 | 22192 | rna-XM_036566144.1 9059433 | 33 | 26881349 | 26903540 | Colossoma macropomum 42526 | AAG|GTGAGCTTTT...GTGTCGCTGATC/GTGTCGCTGATC...TTCAG|GTC | 1 | 1 | 87.152 |
| 48954694 | GT-AG | 0 | 1.000000099473604e-05 | 489 | rna-XM_036566144.1 9059433 | 34 | 26903642 | 26904130 | Colossoma macropomum 42526 | AAC|GTGAGTTCAA...TGCACATTTGTG/CGGGTGCACATT...TACAG|AGA | 0 | 1 | 89.533 |
| 48954695 | GT-AG | 0 | 1.000000099473604e-05 | 701 | rna-XM_036566144.1 9059433 | 35 | 26904178 | 26904878 | Colossoma macropomum 42526 | TGG|GTAAATAGCT...TTTTTCTTTTTT/TTTTCTGTCATT...CTCAG|GCA | 2 | 1 | 90.641 |
| 48961134 | GT-AG | 0 | 1.000000099473604e-05 | 8972 | rna-XM_036566144.1 9059433 | 36 | 26905091 | 26914062 | Colossoma macropomum 42526 | CAG|GTGAGGGGTT...ATTTCTTTAGTC/TTAGTCCTCATT...CACAG|GTT | 0 | 95.639 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);