home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

29 rows where transcript_id = 14424036

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: score, length, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
77101591 GT-AG 0 0.0184421836802867 78 rna-XM_006414302.2 14424036 1 10767939 10768016 Eutrema salsugineum 72664 TTG|GTATGTCTCC...TTACTTTTGATT/TTACTTTTGATT...ACCAG|CTT 0 1 1.581
77101592 GT-AG 0 1.000000099473604e-05 81 rna-XM_006414302.2 14424036 2 10768180 10768260 Eutrema salsugineum 72664 CTG|GTAGGGATGC...CTACTTGTGATT/TCAATTCTCACT...TCTAG|AAA 1 1 4.544
77101593 GT-AG 0 0.000587384179591 86 rna-XM_006414302.2 14424036 3 10768353 10768438 Eutrema salsugineum 72664 AAG|GTTTTTATTC...TTGTTCTTATTT/TTTGTTCTTATT...TGCAG|CAC 0 1 6.216
77101594 GT-AG 0 4.202127499826094e-05 81 rna-XM_006414302.2 14424036 4 10768552 10768632 Eutrema salsugineum 72664 CAG|GTTATCTTTT...CCTGTCTTAGTA/CTTTGCCTGATT...CACAG|TGG 2 1 8.27
77101595 GT-AG 0 1.000000099473604e-05 113 rna-XM_006414302.2 14424036 5 10768689 10768801 Eutrema salsugineum 72664 TTG|GTGAGGACTC...TTTTTTTTCACG/TTTTTTTTCACG...TGCAG|AAC 1 1 9.288
77101596 GT-AG 0 1.000000099473604e-05 102 rna-XM_006414302.2 14424036 6 10768900 10769001 Eutrema salsugineum 72664 GAT|GTAAGTCACT...TTGATTTTGATT/TTGATTTTGATT...TGCAG|GCT 0 1 11.069
77101597 GT-AG 0 3.096022367777146e-05 234 rna-XM_006414302.2 14424036 7 10769140 10769373 Eutrema salsugineum 72664 CCT|GTAATTGCCT...AGTTCTTTGGCC/TTTGTGGTAACT...ATCAG|GTG 0 1 13.577
77101598 GT-AG 0 1.000000099473604e-05 631 rna-XM_006414302.2 14424036 8 10769686 10770316 Eutrema salsugineum 72664 CAG|GTAGGATGTT...TATTCTTTGGAT/TATTGTCTAATA...TACAG|GAC 0 1 19.248
77101599 GT-AG 0 1.000000099473604e-05 358 rna-XM_006414302.2 14424036 9 10770602 10770959 Eutrema salsugineum 72664 TAT|GTAAGTGTAT...GGTAACTTGATC/CTTGATCTGATT...TCCAG|GCT 0 1 24.427
77101600 GT-AG 0 1.772617410657413e-05 85 rna-XM_006414302.2 14424036 10 10771122 10771206 Eutrema salsugineum 72664 GAG|GTAATTATAT...TGTTTCTTACTT/ATGTTTCTTACT...GCTAG|GCA 0 1 27.372
77101601 GT-AG 0 0.0002977160124731 80 rna-XM_006414302.2 14424036 11 10771399 10771478 Eutrema salsugineum 72664 CCA|GTAAGATTTA...TTCTCTTTGATG/TTCTCTTTGATG...TTCAG|GTA 0 1 30.862
77101602 GT-AG 0 0.939537760418664 282 rna-XM_006414302.2 14424036 12 10771523 10771804 Eutrema salsugineum 72664 CCA|GTATGTTTAT...AATTTCTTGATT/AATTTCTTGATT...TGCAG|CTC 2 1 31.661
77101603 GT-AG 0 0.0124141934466424 477 rna-XM_006414302.2 14424036 13 10771875 10772351 Eutrema salsugineum 72664 AAG|GTATGCTGTC...CAGTTTTTGAAC/CAGTTTTTGAAC...CACAG|GAG 0 1 32.933
77101604 GT-AG 0 1.784706418089734e-05 75 rna-XM_006414302.2 14424036 14 10772514 10772588 Eutrema salsugineum 72664 CAG|GTAGTCACGT...ATTTTTTTCACT/ATTTTTTTCACT...TCAAG|GCA 0 1 35.878
77101605 GT-AG 0 1.000000099473604e-05 137 rna-XM_006414302.2 14424036 15 10772697 10772833 Eutrema salsugineum 72664 CAG|GTTTGTCAAC...TTTGCCTTTCTG/TTTTATTTTACG...TTCAG|GTC 0 1 37.841
77101606 GT-AG 0 0.0157757501416402 340 rna-XM_006414302.2 14424036 16 10772872 10773211 Eutrema salsugineum 72664 TCG|GTATGCAATT...TTGTTCTTACTC/CTTGTTCTTACT...TTTAG|GGT 2 1 38.531
77101607 GT-AG 0 2.2120566619978732e-05 192 rna-XM_006414302.2 14424036 17 10773357 10773548 Eutrema salsugineum 72664 AAG|GTATAAAGTG...TATTTCTTTGTT/AATTTTCTAATT...TACAG|GCT 0 1 41.167
77101608 GT-AG 0 0.0005493979282468 557 rna-XM_006414302.2 14424036 18 10773676 10774232 Eutrema salsugineum 72664 CAG|GTACTTTTGG...TTTTTCTTTTCT/AACATGCTCATT...CATAG|GCG 1 1 43.475
77101609 GT-AG 0 0.0887690218388644 386 rna-XM_006414302.2 14424036 19 10774427 10774812 Eutrema salsugineum 72664 CTG|GTATGTTTTT...TTACTTTTGATT/TTACTTTTGATT...AACAG|GTA 0 1 47.001
77101610 GT-AG 0 0.0040046895089368 294 rna-XM_006414302.2 14424036 20 10774944 10775237 Eutrema salsugineum 72664 CAG|GTACTCTCCC...ATTTCCTTGCCT/CCTTGCCTAACT...TCTAG|GAA 2 1 49.382
77101611 GT-AG 0 0.0110064605864298 92 rna-XM_006414302.2 14424036 21 10775314 10775405 Eutrema salsugineum 72664 AAG|GTATTTTTGG...TATTCCTTTTTT/CAGAATTTTACA...TGCAG|GGT 0 1 50.763
77101612 GT-AG 0 0.000171358247682 330 rna-XM_006414302.2 14424036 22 10775517 10775846 Eutrema salsugineum 72664 CAG|GTATGTCATA...TATGCTTTGTTT/ATTCGCCTGACA...TGCAG|ATT 0 1 52.781
77101613 GT-AG 0 0.0234335762041499 288 rna-XM_006414302.2 14424036 23 10776030 10776317 Eutrema salsugineum 72664 GAG|GTTCCTTTCC...AATGTTTTGACT/AATGTTTTGACT...TTTAG|CAC 0 1 56.107
77101614 GT-AG 0 1.000000099473604e-05 193 rna-XM_006414302.2 14424036 24 10776465 10776657 Eutrema salsugineum 72664 AGG|GTGAGTTGCT...CTGATTTTAAGT/CTGATTTTAAGT...TGCAG|CTG 0 1 58.779
77101615 GT-AG 0 0.0001925018662203 95 rna-XM_006414302.2 14424036 25 10776832 10776926 Eutrema salsugineum 72664 CTG|GTATTGTCTT...ATTGCATTCACA/ATTCTACTAATT...ACCAG|GAA 0 1 61.941
77101616 GT-AG 0 1.000000099473604e-05 76 rna-XM_006414302.2 14424036 26 10777275 10777350 Eutrema salsugineum 72664 AAG|GTGAGACATT...ATTTCTTTGTTC/ATGAGTATAATT...TGCAG|AGC 0 1 68.266
77101617 GT-AG 0 0.0004124721210875 428 rna-XM_006414302.2 14424036 27 10777681 10778108 Eutrema salsugineum 72664 TTG|GTATGGTTTA...CATTCTGTACTT/AAAATTCTGATA...TACAG|GGA 0 1 74.264
77101618 GT-AG 0 1.000000099473604e-05 131 rna-XM_006414302.2 14424036 28 10778295 10778425 Eutrema salsugineum 72664 CAG|GTAATATTTT...TGGTTTTTGCCT/TAATAACTTATG...TGTAG|GCC 0 1 77.644
77101619 GT-AG 0 1.000000099473604e-05 92 rna-XM_006414302.2 14424036 29 10779401 10779492 Eutrema salsugineum 72664 CAG|GTGGGCAATT...GTTTTTGTAAAA/GAAATACTCATA...TGCAG|GTT 0 1 95.365

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 28.786ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)