introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 22173172
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 120104977 | GT-AG | 0 | 0.0140555991890559 | 565 | rna-XM_036383046.1 22173172 | 2 | 46146877 | 46147441 | Molothrus ater 84834 | CAG|GTATTTTCTC...TTTTTCTTTTCT/GAAGATTTAATT...TGCAG|CCA | 0 | 1 | 7.487 |
| 120104978 | GT-AG | 0 | 1.000000099473604e-05 | 798 | rna-XM_036383046.1 22173172 | 3 | 46145989 | 46146786 | Molothrus ater 84834 | ATG|GTAAGTAGAT...AATTCCTAATTA/CAATTCCTAATT...TACAG|AAT | 0 | 1 | 9.772 |
| 120104979 | GT-AG | 0 | 1.000000099473604e-05 | 1032 | rna-XM_036383046.1 22173172 | 4 | 46144879 | 46145910 | Molothrus ater 84834 | CAG|GTAATGGATG...ACCTCTTTAATT/ACCTCTTTAATT...CACAG|GTC | 0 | 1 | 11.751 |
| 120104980 | GT-AG | 0 | 1.000000099473604e-05 | 1502 | rna-XM_036383046.1 22173172 | 5 | 46143275 | 46144776 | Molothrus ater 84834 | CAG|GTAAAAGGAA...ATGGCTTTGACT/ATGGCTTTGACT...ATTAG|AGA | 0 | 1 | 14.34 |
| 120104981 | GT-AG | 0 | 1.000000099473604e-05 | 1068 | rna-XM_036383046.1 22173172 | 6 | 46142025 | 46143092 | Molothrus ater 84834 | GCG|GTGAGTGCAG...TTTTTCTTCTGT/TACTTACAGACA...TAAAG|AGC | 2 | 1 | 18.959 |
| 120104982 | GT-AG | 0 | 0.0007623633151933 | 139 | rna-XM_036383046.1 22173172 | 7 | 46141789 | 46141927 | Molothrus ater 84834 | CAA|GTAAGCTTGG...ATAGCCTTCTTT/AGTTAACTAATA...TCCAG|TCT | 0 | 1 | 21.421 |
| 120104983 | GT-AG | 0 | 0.0003536929216894 | 1461 | rna-XM_036383046.1 22173172 | 8 | 46140199 | 46141659 | Molothrus ater 84834 | AAA|GTAGGTTTCT...AAGTTTTTACTT/AAAGTTTTTACT...TTCAG|GTT | 0 | 1 | 24.695 |
| 120104984 | GT-AG | 0 | 1.000000099473604e-05 | 159 | rna-XM_036383046.1 22173172 | 9 | 46139935 | 46140093 | Molothrus ater 84834 | AAG|GTTGGTTAAG...TTTATCTAATCT/GTGTTACTAATT...TACAG|CAG | 0 | 1 | 27.36 |
| 120104985 | GT-AG | 0 | 1.000000099473604e-05 | 1219 | rna-XM_036383046.1 22173172 | 10 | 46138624 | 46139842 | Molothrus ater 84834 | CAG|GTAGGAAGGA...GGTTTTTTGTCT/TCAAAACTGATT...ACCAG|ATG | 2 | 1 | 29.695 |
| 120104986 | GT-AG | 0 | 1.000000099473604e-05 | 683 | rna-XM_036383046.1 22173172 | 11 | 46137823 | 46138505 | Molothrus ater 84834 | GAG|GTGAGTGAGG...TTTTTTTTATAA/GTTTTTTTTATA...TCTAG|GTG | 0 | 1 | 32.69 |
| 120104987 | GT-AG | 0 | 0.0003339743014833 | 3000 | rna-XM_036383046.1 22173172 | 12 | 46134700 | 46137699 | Molothrus ater 84834 | GTG|GTATGTAACA...TTCTTTTTGTTC/TGCCTTGTGAAC...TGCAG|TAC | 0 | 1 | 35.812 |
| 120104988 | GT-AG | 0 | 1.000000099473604e-05 | 308 | rna-XM_036383046.1 22173172 | 13 | 46134266 | 46134573 | Molothrus ater 84834 | GAG|GTAAGAGAAA...TGGGTTTTTTTT/GAAAAATACATT...TGCAG|GTG | 0 | 1 | 39.01 |
| 120104989 | GT-AG | 0 | 0.0001054955819183 | 1780 | rna-XM_036383046.1 22173172 | 14 | 46132319 | 46134098 | Molothrus ater 84834 | GAG|GTAGCATGAA...AGCACTTGGATG/TGCAGTCTGAAG...TTCAG|TGT | 2 | 1 | 43.249 |
| 120104990 | GT-AG | 0 | 0.0704974738010871 | 1842 | rna-XM_036383046.1 22173172 | 15 | 46130329 | 46132170 | Molothrus ater 84834 | CAG|GTATTTTCCT...TTTCCTTTAATG/TTTCCTTTAATG...TACAG|CTT | 0 | 1 | 47.005 |
| 120104991 | GT-AG | 0 | 1.000000099473604e-05 | 1233 | rna-XM_036383046.1 22173172 | 16 | 46128942 | 46130174 | Molothrus ater 84834 | GTG|GTAAGATATG...TCTCCCTTATTC/CCCTTATTCATT...GAAAG|AAA | 1 | 1 | 50.914 |
| 120104992 | GT-AG | 0 | 1.000000099473604e-05 | 586 | rna-XM_036383046.1 22173172 | 17 | 46128199 | 46128784 | Molothrus ater 84834 | GGA|GTAAGACCCG...CATGTTTTATTA/TCATGTTTTATT...TGCAG|GTA | 2 | 1 | 54.898 |
| 120104993 | GT-AG | 0 | 1.000000099473604e-05 | 385 | rna-XM_036383046.1 22173172 | 18 | 46127717 | 46128101 | Molothrus ater 84834 | GAG|GTAAAGTAAC...TCTTTTTTAAAT/TCTTTTTTAAAT...CTAAG|GTA | 0 | 1 | 57.36 |
| 120104994 | GT-AG | 0 | 0.006906699614694 | 96 | rna-XM_036383046.1 22173172 | 19 | 46127544 | 46127639 | Molothrus ater 84834 | CAG|GTAGCTATTC...TATACCTTAAGC/TGCTGTTTAATA...CATAG|CCT | 2 | 1 | 59.315 |
| 120104995 | GT-AG | 0 | 1.000000099473604e-05 | 572 | rna-XM_036383046.1 22173172 | 20 | 46126863 | 46127434 | Molothrus ater 84834 | CAA|GTGAGTTAAC...TGTCTTGTAACC/CAGAGCCTCACG...CTTAG|CTC | 0 | 1 | 62.081 |
| 120104996 | GT-AG | 0 | 1.000000099473604e-05 | 1949 | rna-XM_036383046.1 22173172 | 21 | 46124794 | 46126742 | Molothrus ater 84834 | GTG|GTGAGTCTTC...TTCCCCTTGTTT/AAATGCATCATT...CATAG|GAG | 0 | 1 | 65.127 |
| 120104997 | GT-AG | 0 | 1.000000099473604e-05 | 6909 | rna-XM_036383046.1 22173172 | 22 | 46117685 | 46124593 | Molothrus ater 84834 | CAG|GTTAGCAAGG...TTCTTTTTATTT/CTTCTTTTTATT...TATAG|ATT | 2 | 1 | 70.203 |
| 120104998 | GT-AG | 0 | 1.000000099473604e-05 | 10875 | rna-XM_036383046.1 22173172 | 23 | 46106722 | 46117596 | Molothrus ater 84834 | AAG|GTTGGATTTG...TTTTCATTAATC/AAAATTTTCATT...TTTAG|GCA | 0 | 1 | 72.437 |
| 120104999 | GT-AG | 0 | 0.0051235374329329 | 1531 | rna-XM_036383046.1 22173172 | 24 | 46105047 | 46106577 | Molothrus ater 84834 | CTG|GTATGCTAAT...AGTTTTGTAGTT/TTGTAGTTCAAC...CTTAG|TTA | 0 | 1 | 76.091 |
| 120105000 | GT-AG | 0 | 1.000000099473604e-05 | 8735 | rna-XM_036383046.1 22173172 | 25 | 46096221 | 46104955 | Molothrus ater 84834 | CAG|GTTAGTTTGT...TTTTTTTTCGCT/TGGCTTCTGAGT...TTCAG|GTA | 1 | 1 | 78.401 |
| 120105001 | GT-AG | 0 | 1.000000099473604e-05 | 2316 | rna-XM_036383046.1 22173172 | 26 | 46093774 | 46096089 | Molothrus ater 84834 | CTG|GTAGGTGCCT...CTGTCCTAAATG/TCTGTCCTAAAT...GCCAG|TCT | 0 | 1 | 81.726 |
| 120105002 | GT-AG | 0 | 1.000000099473604e-05 | 1118 | rna-XM_036383046.1 22173172 | 27 | 46092583 | 46093700 | Molothrus ater 84834 | AAG|GTATGGGCAA...CACTTTTTAAAA/TGATTGCTCACT...TTCAG|AGG | 1 | 1 | 83.579 |
| 120105003 | GT-AG | 0 | 0.0009676271436461 | 1090 | rna-XM_036383046.1 22173172 | 28 | 46091398 | 46092487 | Molothrus ater 84834 | CAG|GTAATCTTTC...TGGCTTTTACCA/CAGGATTTTACT...TTCAG|CAA | 0 | 1 | 85.99 |
| 120105004 | GT-AG | 0 | 0.0001785223959755 | 895 | rna-XM_036383046.1 22173172 | 29 | 46090264 | 46091158 | Molothrus ater 84834 | GGA|GTACGTGCTG...TGTGTTTTGAAG/TGTGTTTTGAAG...TGCAG|GCA | 2 | 1 | 92.056 |
| 120105005 | GT-AG | 0 | 1.000000099473604e-05 | 455 | rna-XM_036383046.1 22173172 | 30 | 46089661 | 46090115 | Molothrus ater 84834 | GTG|GTGAGTTGAA...GTTCTTTTGATG/GTTCTTTTGATG...TGCAG|CCT | 0 | 1 | 95.812 |
| 120112065 | GT-AG | 0 | 1.000000099473604e-05 | 20909 | rna-XM_036383046.1 22173172 | 1 | 46147566 | 46168474 | Molothrus ater 84834 | CGG|GTGAGCTGCG...ATCTACATATCT/TCTACATTTATG...ACCAG|GTG | 0 | 4.518 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);