home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

50 rows where transcript_id = 35103464

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: dinucleotide_pair, score, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
197656830 GT-AG 0 0.0002556852284741 703 rna-XM_007052123.2 35103464 1 35826441 35827143 Theobroma cacao 3641 CAG|GTTCTTTCTT...ATTTTCTTAATG/AATTTTCTTAAT...GGCAG|TTC 0 1 1.367
197656831 GT-AG 0 1.000000099473604e-05 115 rna-XM_007052123.2 35103464 2 35826028 35826142 Theobroma cacao 3641 GCG|GTAAGTTGCA...ACACTGTTGATC/ACACTGTTGATC...TTTAG|GGA 1 1 5.482
197656832 GT-AG 0 2.3261867636480636e-05 166 rna-XM_007052123.2 35103464 3 35825633 35825798 Theobroma cacao 3641 AAG|GTACTGATTT...TGTTCCTTTTCT/CATCAACTTAAA...TGCAG|GAA 2 1 8.644
197656833 GT-AG 0 1.000000099473604e-05 108 rna-XM_007052123.2 35103464 4 35825185 35825292 Theobroma cacao 3641 CTT|GTGAGTTATG...CTGTTTTTAGAA/ACTGTTTTTAGA...TTCAG|GCT 0 1 13.339
197656834 GT-AG 0 0.0005713537148676 114 rna-XM_007052123.2 35103464 5 35824957 35825070 Theobroma cacao 3641 CAG|GTATTATTGT...ATTTGTTTAATT/TTATTTTTCATT...TGCAG|GAA 0 1 14.913
197656835 GT-AG 0 1.000000099473604e-05 83 rna-XM_007052123.2 35103464 6 35824745 35824827 Theobroma cacao 3641 CTG|GTAAGAGCTC...TTCTTCTTCTCG/ATACTGTTCAAT...TGCAG|GCC 0 1 16.694
197656836 GT-AG 0 0.0011601316155007 98 rna-XM_007052123.2 35103464 7 35824455 35824552 Theobroma cacao 3641 AAT|GTACATATCT...GCATCTTTGTTT/TTTGGGTTGATA...TGCAG|ACG 0 1 19.345
197656837 GT-AG 0 0.0001362805256685 267 rna-XM_007052123.2 35103464 8 35823993 35824259 Theobroma cacao 3641 AAG|GTAAATTTGG...TGTATCTTATCA/TTGTATCTTATC...TGCAG|ATT 0 1 22.038
197656838 GT-AG 0 1.000000099473604e-05 101 rna-XM_007052123.2 35103464 9 35823754 35823854 Theobroma cacao 3641 GAG|GTAATATCGA...GAGCTATTAGTT/TTTGGGTTGACA...TTCAG|GAG 0 1 23.944
197656839 GT-AG 0 0.0053868215525492 90 rna-XM_007052123.2 35103464 10 35823541 35823630 Theobroma cacao 3641 AAG|GTATTATTTC...ATGTTTTTATCT/CATGTTTTTATC...TTCAG|GTC 0 1 25.642
197656840 GC-AG 0 1.000000099473604e-05 290 rna-XM_007052123.2 35103464 11 35822997 35823286 Theobroma cacao 3641 AAG|GCATGTTGTT...GGATGTTTAATA/GGATGTTTAATA...GACAG|GGA 2 1 29.149
197656841 GT-AG 0 1.000000099473604e-05 503 rna-XM_007052123.2 35103464 12 35822390 35822892 Theobroma cacao 3641 TTG|GTGAGTTATT...TTCTCTTTGAAC/TTTGAACTTATG...TTCAG|GCT 1 1 30.585
197656842 GT-AG 0 1.000000099473604e-05 126 rna-XM_007052123.2 35103464 13 35822169 35822294 Theobroma cacao 3641 AAA|GTTAGTGTTA...AATTCTCTAATT/AATTCTCTAATT...TACAG|ATG 0 1 31.897
197656843 GT-AG 0 1.000000099473604e-05 515 rna-XM_007052123.2 35103464 14 35821417 35821931 Theobroma cacao 3641 GAG|GTAAAACCAT...TTTTTTTTATCT/TTTTTTTTTATC...TGCAG|TTA 0 1 35.17
197656844 GT-AG 0 0.0206213383729518 76 rna-XM_007052123.2 35103464 15 35821152 35821227 Theobroma cacao 3641 AAA|GTATGCCTGG...ATATTATTACCA/CTAGAACTAATA...TGCAG|CTG 0 1 37.78
197656845 GT-AG 0 3.781740705219119e-05 143 rna-XM_007052123.2 35103464 16 35820927 35821069 Theobroma cacao 3641 CTA|GTAATAATTT...TTTTTTTTAAAT/TTTTTTTTAAAT...TACAG|GAT 1 1 38.912
197656846 GT-AG 0 0.0038832959309143 865 rna-XM_007052123.2 35103464 17 35819994 35820858 Theobroma cacao 3641 GAG|GTATTTAATG...CTTCTCTTAATC/CTTCTCTTAATC...TACAG|GCT 0 1 39.851
197656847 GT-AG 0 1.000000099473604e-05 115 rna-XM_007052123.2 35103464 18 35819733 35819847 Theobroma cacao 3641 AAG|GTCAAACTTT...ATTAACTTAATC/ATTGAATTAACT...TGCAG|AGC 2 1 41.867
197656848 GT-AG 0 0.0033234741122315 89 rna-XM_007052123.2 35103464 19 35819535 35819623 Theobroma cacao 3641 AAG|GTTTCTGTCT...ATATATTTGACC/AAAATATTTATT...GTTAG|GTT 0 1 43.372
197656849 GT-AG 0 0.0006237376938812 150 rna-XM_007052123.2 35103464 20 35819238 35819387 Theobroma cacao 3641 GAG|GTACTTGTTT...TTGTTCTTATTT/TTTGTTCTTATT...TCCAG|GCA 0 1 45.402
197656850 GT-AG 0 0.0003819493419297 81 rna-XM_007052123.2 35103464 21 35819109 35819189 Theobroma cacao 3641 AAG|GTATGATTAT...TTGTTCTGAACT/CTTGTTCTGAAC...TGCAG|ATT 0 1 46.065
197656851 GT-AG 0 2.6782989360885656e-05 127 rna-XM_007052123.2 35103464 22 35818853 35818979 Theobroma cacao 3641 GAG|GTATTGACAT...TGTGTTTTATTG/CTGTGTTTTATT...TTTAG|GTA 0 1 47.846
197656852 GT-AG 0 0.0002429657016225 81 rna-XM_007052123.2 35103464 23 35818532 35818612 Theobroma cacao 3641 CAG|GTTTGTTTCT...TTTTCTTTGAGT/TTTTCTTTGAGT...TTTAG|TAT 0 1 51.16
197656853 GT-AG 0 6.416170590385922e-05 96 rna-XM_007052123.2 35103464 24 35818295 35818390 Theobroma cacao 3641 CAG|GTTGTCTTTT...GAATTATTAACT/TTTTTGTTTATT...TGTAG|CTT 0 1 53.107
197656854 GT-AG 0 1.000000099473604e-05 89 rna-XM_007052123.2 35103464 25 35818120 35818208 Theobroma cacao 3641 GAG|GTTTGATGCC...TTGGTCTTGGCG/GTTGTATACATT...TGCAG|GGT 2 1 54.294
197656855 GT-AG 0 1.116903856471134e-05 483 rna-XM_007052123.2 35103464 26 35817531 35818013 Theobroma cacao 3641 AAG|GTCTGTAAAG...CTATTCTTAGCT/ATCATTTTAATC...TGTAG|GAT 0 1 55.758
197656856 GT-AG 0 1.000000099473604e-05 128 rna-XM_007052123.2 35103464 27 35817307 35817434 Theobroma cacao 3641 AAG|GTTTTGTCAC...ATGGTTTTAATC/ATGGTTTTAATC...TCTAG|GAA 0 1 57.084
197656857 GT-AG 0 1.8350440570490347e-05 428 rna-XM_007052123.2 35103464 28 35816726 35817153 Theobroma cacao 3641 AAG|GTCAGCTTGT...TTCCCTTTGACT/CTGGTTCTAACT...TATAG|GCA 0 1 59.196
197656858 GT-AG 0 0.0031962225703904 88 rna-XM_007052123.2 35103464 29 35816464 35816551 Theobroma cacao 3641 GAG|GTATTCATAT...TTAGTTTTGAGA/TTAGTTTTGAGA...TTTAG|GAC 0 1 61.599
197656859 GT-AG 0 1.000000099473604e-05 87 rna-XM_007052123.2 35103464 30 35816162 35816248 Theobroma cacao 3641 GAG|GTGTTACATT...TCTATTTGAACT/TTGTTGCTGATC...TTCAG|TGC 2 1 64.568
197656860 GT-AG 0 1.000000099473604e-05 84 rna-XM_007052123.2 35103464 31 35815942 35816025 Theobroma cacao 3641 AAA|GTAAGAAATT...TGCTTTTTACTA/TTTTTACTAATA...GCTAG|GAA 0 1 66.446
197656861 GT-AG 0 1.000000099473604e-05 528 rna-XM_007052123.2 35103464 32 35815288 35815815 Theobroma cacao 3641 AAG|GTTGAATATG...ATGTTTCTGACT/ATGTTTCTGACT...TTCAG|TTG 0 1 68.186
197656862 GT-AG 0 0.3463242752350556 242 rna-XM_007052123.2 35103464 33 35815001 35815242 Theobroma cacao 3641 CAG|GTATGCTTTA...TAATTCTTGACA/TGTTATTTAATT...TGTAG|GGA 0 1 68.807
197656863 GT-AG 0 0.0024382912445282 95 rna-XM_007052123.2 35103464 34 35814828 35814922 Theobroma cacao 3641 AAG|GTGTTCTAAT...TCATCCTTGACT/ATGATTCTCATC...TACAG|GTT 0 1 69.884
197656864 GT-AG 0 0.0027133028165441 418 rna-XM_007052123.2 35103464 35 35814293 35814710 Theobroma cacao 3641 TGG|GTATGTTCTA...TCTGTGTTGATT/TCTGTGTTGATT...TACAG|GTG 0 1 71.5
197656865 GT-AG 0 0.0195274250924256 76 rna-XM_007052123.2 35103464 36 35814090 35814165 Theobroma cacao 3641 ATA|GTATGTATTA...AAATTCTAAATA/AAAATTCTAAAT...TGCAG|AAG 1 1 73.253
197656866 GT-AG 0 1.5929483704946932e-05 126 rna-XM_007052123.2 35103464 37 35813863 35813988 Theobroma cacao 3641 AAG|GTTTGTCATC...TCTCTCTTATTT/CTTCTGTTCATT...TGCAG|GTT 0 1 74.648
197656867 GT-AG 0 1.000000099473604e-05 81 rna-XM_007052123.2 35103464 38 35813644 35813724 Theobroma cacao 3641 AAG|GTCAATGGTT...TTGTTCTTAAGT/TTTCTTCTGACC...AGCAG|GTA 0 1 76.553
197656868 GT-AG 0 0.0002442625486425 750 rna-XM_007052123.2 35103464 39 35812810 35813559 Theobroma cacao 3641 CAG|GTGACCTGAA...AAATTCTTAGTC/TTCTTAGTCATT...TGTAG|GTT 0 1 77.713
197656869 GT-AG 0 1.000000099473604e-05 110 rna-XM_007052123.2 35103464 40 35812553 35812662 Theobroma cacao 3641 ACA|GTGAATCTCT...CATTTTTTGTTT/GCCAAACTCATT...GTTAG|GAA 0 1 79.743
197656870 GT-AG 0 0.0003950922372057 100 rna-XM_007052123.2 35103464 41 35812321 35812420 Theobroma cacao 3641 AAG|GTCTTGTTAT...TTTTTCTTAATT/TTTTTCTTAATT...TTTAG|TAT 0 1 81.566
197656871 GT-AG 0 1.000000099473604e-05 71 rna-XM_007052123.2 35103464 42 35812202 35812272 Theobroma cacao 3641 AAG|GTGAATATTC...TATGTCTTGATT/TATGTCTTGATT...AACAG|ATT 0 1 82.229
197656872 GT-AG 0 1.000000099473604e-05 72 rna-XM_007052123.2 35103464 43 35811983 35812054 Theobroma cacao 3641 CAG|GTAGGTTCAT...TTGTCATTATTG/CTAATTCTCATT...TGCAG|ATT 0 1 84.258
197656873 GT-AG 0 1.000000099473604e-05 589 rna-XM_007052123.2 35103464 44 35811332 35811920 Theobroma cacao 3641 CAG|GTAAGAATTG...CTTATGTTGATT/CTTATGTTGATT...TGTAG|CTT 2 1 85.115
197656874 GT-AG 0 0.0003757921650923 95 rna-XM_007052123.2 35103464 45 35811083 35811177 Theobroma cacao 3641 CCG|GTTTGCCTCT...TTATTCTTTCTT/ATCCCACTTATT...TGTAG|GTT 0 1 87.241
197656875 GT-AG 0 0.00074883340088 76 rna-XM_007052123.2 35103464 46 35810815 35810890 Theobroma cacao 3641 AAG|GTGTCAATTG...GATTGTTTAATT/GATTGTTTAATT...CGCAG|ATT 0 1 89.892
197656876 GT-AG 0 0.0072102130767594 241 rna-XM_007052123.2 35103464 47 35810469 35810709 Theobroma cacao 3641 AAG|GTACCTCAAT...CCTTCCTAAATT/TGTTTGCTCAAA...TGCAG|ACA 0 1 91.342
197656877 GT-AG 0 0.0066448772216664 267 rna-XM_007052123.2 35103464 48 35810064 35810330 Theobroma cacao 3641 CAG|GTATTCTAGA...CACTGTTTAATT/ATTAACTTCACT...TTTAG|GCC 0 1 93.248
197656878 GT-AG 0 1.000000099473604e-05 93 rna-XM_007052123.2 35103464 49 35809734 35809826 Theobroma cacao 3641 CAG|GTGAGAATTT...TATACTTTGCTT/CTGCTGCTCATT...ATCAG|GAA 0 1 96.52
197656879 GT-AG 0 1.000000099473604e-05 84 rna-XM_007052123.2 35103464 50 35809548 35809631 Theobroma cacao 3641 AAG|GTAAAAATTA...GTCCCCTTGAGG/AGGTTGCTCATG...TTCAG|AAT 0 1 97.929

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 61.614ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)