17b7d3c37be41135afaf8e91e365e3847af96ca5 lrnassar Mon Jun 22 10:56:56 2026 -0700 Add TAD (topologically associating domains) track set on hg19, hg38, mm10, mm39. refs #21599 New "tads" superTrack collecting published TAD calls, alpha-gated via include tad.ra alpha in each assembly's trackDb.ra. hg38 (all five sources): Dixon 2012 domains, Schmitt 2016 boundaries, McArthur & Capra 2021 boundary stability, ENCODE contact domains (faceted composite over 117 biosamples), and 3D Genome Browser 2.0 domains (faceted composite over 464 datasets). hg19: the three sources with hg19-compatible data (Dixon, Schmitt, McArthur). mm10/mm39 (domains only; the boundary sources have no mouse data): Dixon, ENCODE (faceted, 16 biosamples), and 3D Genome Browser (faceted, 30 datasets); mm39 lifted from mm10, lift noted in the long labels. Faceted composites are organ-colored from a TAD-owned organ_colors.json symlinked into /gbdb//bbi/tad/. Build scripts and autoSql are version-controlled under makeDb/scripts/tad/ and symlinked into the per-source build dirs. Provenance and fetch for every dataset are documented in the makedocs (doc/hg38/tad.txt, doc/mm10/tad.txt, doc/mm39/tad.txt, and the hg19 TAD section in doc/hg19.txt). diff --git src/hg/makeDb/trackDb/human/hg38/tads3dgb.html src/hg/makeDb/trackDb/human/hg38/tads3dgb.html new file mode 100644 index 00000000000..e5569ed45bd --- /dev/null +++ src/hg/makeDb/trackDb/human/hg38/tads3dgb.html @@ -0,0 +1,127 @@ +

Description

+

+This composite shows TAD domains from the +3D Genome Browser +(3DGB) across 464 human Hi-C and Micro-C datasets on hg38. Each subtrack is one +3DGB dataset, displayed exactly as called and published by 3DGB. TAD domains are +megabase-scale regions of the genome that preferentially self-interact; their boundaries +(frequently bound by CTCF and cohesin) insulate neighboring regions and constrain +enhancer-promoter contacts. +

+

+The 464 datasets span a wide range of normal and cancer samples, baseline and +perturbation conditions, organs, and cell types, drawn from many published studies and +re-processed by 3DGB through a single TAD-calling pipeline. They are browsable with a +faceted selector (see below); the displayed domain intervals are 3DGB's own, with +no UCSC re-calling, merging, lifting, or recurrence scoring. +

+ +

Display Conventions and Configuration

+

+Each subtrack is drawn as boxes spanning the self-interacting domains and is +colored by organ. By default a small set of canonical reference datasets is shown +(GM12878, H1-ESC, IMR-90, and HMEC); all other datasets are turned off and can be enabled +through the faceted selector. Mousing over a domain shows the dataset name, organ, and +assay. +

+

+These 464 datasets are not a cross-comparable consensus. Each represents one +dataset's own TAD calls, made by different laboratories on different samples; coordinates +are therefore not directly comparable across subtracks, and they are not directly +comparable to the other TAD tracks in this set (which use different callers and +resolutions). Because calls are made on binned Hi-C data (3DGB calls TADs at 25 kb), +domain edges are uncertain to roughly the bin size, and domains do not tile the genome end +to end. +

+ +

Faceted selector

+

+Use the faceted selector on the track configuration page to choose which datasets to +display. Datasets can be filtered by: +

+ + +

The "Provenance" facet

+

+Because this track ships every 3DGB human dataset, a few of the underlying source studies +are already represented elsewhere in the UCSC Genome Browser. The Provenance facet +flags these so they can be identified or filtered out: +

+ +

+To view only datasets that are new to the browser, select "Novel to browser" in +the Provenance facet. Note that even the "Also in another UCSC track" datasets +may differ in their displayed coordinates from the other UCSC tracks, because 3DGB +re-processed and re-called each study through its own pipeline. +

+ +

Methods

+

+TAD domains were called by the 3D Genome Browser pipeline and are displayed verbatim. +UCSC performed only a format normalization: each 3DGB per-dataset TAD file (a BED-like +file with placeholder columns and an alternating two-color shading that carries no +biological meaning) was reshaped to a plain four-column bigBed (chromosome, start, end, +dataset name) and indexed. No domain coordinates were changed, and no re-calling, merging, +lifting (all datasets are native hg38), or recurrence scoring was performed. The dataset +metadata used to drive the faceted selector (organ, cell type, assay, year, study) was +copied directly from the 3D Genome Browser. +

+ +

Data Access

+

+The raw data can be explored interactively with the +Table Browser or the +Data Integrator. For programmatic access, the +track can be accessed using the Genome Browser's +REST API. +The underlying bigBed files can be downloaded from our +download server. +The complete original datasets are available from the +3D Genome Browser. +

+ +

Credits

+

+Thanks to the 3D Genome Browser team (Yue lab, Northwestern University) for assembling and +uniformly processing these datasets. The 3D Genome Browser data are distributed under a +CC BY-NC 4.0 +license (free for non-commercial use). Please cite the 3D Genome Browser, and the original +studies, when using these data. +

+ +

References

+

+Yu S, Fu Y, Wong JH, Wang J, Zhao H, Zhao J, Yue F. +The 3D Genome Browser 2.0: an enhanced online platform for visualizing and analyzing 3D +genome architecture. Nucleic Acids Res. 2026;54(D1):D48-D54. +doi:10.1093/nar/gkaf1109 +

+

+Wang Y, Song F, Zhang B, Zhang L, et al. +The 3D Genome Browser: a web-based browser for visualizing 3D genome organization and +long-range chromatin interactions. Genome Biol. 2018;19(1):151. +doi:10.1186/s13059-018-1519-9 +