Dear Maximilian,
Thank you for your interest in hosting UniversalEPI’s predictions on UCSC Genome Browser. We have exported the model predictions corresponding to all bulk ATAC-seq data from ENCODE (consisting of 157 cell lines and primary cells).
The hub can be found here: https://boevalab.inf.ethz.ch/resources/universalepi_pred_encode/ucsc_track_hub/hub.txt
Information on the input ATAC-seq tracks and the generated Hi-C can be found here: https://boevalab.inf.ethz.ch/resources/universalepi_pred_encode/
It would be wonderful if this hub can be to the UCSC’s list of public track hubs. Please let us know if there are any questions or concerns.
Best regards,
Aayush Grover
Computational Cancer Genomics Lab, ETH Zürich
Institute for Machine Learning, Department of Computer Science
CAB G 31.1
Universitätstrasse 6
8092 Zürich, Switzerland
From:
Boeva Valentina <valenti...@inf.ethz.ch>
Date: Friday, 22 November 2024 at 10:22
To: Maximilian Haeussler <mhae...@ucsc.edu>
Cc: Grover Aayush <aayush...@inf.ethz.ch>, Krymova Ekaterina <ekaterin...@sdsc.ethz.ch>, Rabuzin Lovro <lovro....@inf.ethz.ch>
Subject: RE: Track hubs
Dear Max,
We are interested😊We should have the final model in the coming weeks.
Maybe we could also have a track with predicted hi-C_QTL (if we get enough computing power) ;-) We’ll think about it!
Best wishes
Valentina
From: Maximilian Haeussler <mhae...@ucsc.edu>
Sent: Thursday, November 21, 2024 22:32
To: Boeva Valentina <valenti...@inf.ethz.ch>
Cc: Grover Aayush <aayush...@inf.ethz.ch>; Krymova Ekaterina <ekaterin...@sdsc.ethz.ch>
Subject: Re: Track hubs
Yes, or the HTAN cell lines. They’ll have a lot of hic to train on.
Sharing it via UCSC should be as easy as putting all files into a directory and running a single command. Let me know if you're interested.
On Thu, Nov 21, 2024 at 6:54 AM Boeva Valentina <valenti...@inf.ethz.ch> wrote:
Dear Max,
Thank you for your email! Indeed, we could potentially predict Hi-C for all ENCODE cell lines and share the predictions via UCSC! Great idea!
Will be in touch!
Valentina
From: Maximilian Haeussler <mhae...@ucsc.edu>
Sent: Friday, November 15, 2024 10:49
To: Boeva Valentina <valenti...@inf.ethz.ch>
Subject: Track hubs
Hi Valentina,
thanks for your fascinating talk today at CSHL GI. I work for the UCSC Genome Browser and would love to look at your predictions, once they're published.
To make that easier, it would be great if your group could make the data available in a format that can be easily browsed. Your students can make bigWig or bigBed file from your data, maybe even in bigInteract format see https://genome.ucsc.edu/goldenPath/help/bigBed.html#Ex3, they can put that into a track hub, just a little text file, https://genome.ucsc.edu/goldenpath/help/hubQuickStart.html and send its URL to gen...@soe.ucsc.edu so we can add it to our list here https://genome.ucsc.edu/cgi-bin/hgHubConnect The hub in this format can also be loaded into IGV, Ensembl, NCBI, especially IGV, so anyone who has the link or is on the track hub list, can access your tracks with a single click.
If there are any problems with data hosting or format conversions or any other questions, let me know, ideally also CC gen...@soe.ucsc.edu.
I'll try to find you later, but in case you're busy, at least I sent you this email :-)
Max
Hello,
Thank you for sharing your UniversalEPI predictions hub and submitting a request to add it as a public track hub.
We have added this track hub to the queue. From an initial look, it appears that the UniversalEPI predictions hub is missing the following, which are required for public hubs:
You can read more about the required and recommended guidelines on the following help page:
https://genome.ucsc.edu/goldenPath/help/publicHubGuidelines.html
Let us know if you have any questions regarding your hub or track hub settings, and when you would like us to take another look at the hub.
Please include gen...@soe.ucsc.edu in any replies to ensure visibility by the team. All messages sent to that address are archived on a publicly-accessible Google Groups forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
Gerardo Perez
UCSC Genomics Institute
--
---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To view this discussion visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/6BE8BFBC-7BEC-4A71-A4A0-99A40EF73C8F%40ethz.ch.
Hi Gerardo,
Thank you for pointing us to the missing URLs and sorry for missing them. The track hub is now updated with these details (https://boevalab.inf.ethz.ch/resources/universalepi_pred_encode/ucsc_track_hub/hub.txt). Please let us know if anything else needs to be added.
Many thanks,
Aayush
From:
Gerardo Perez <gpe...@ucsc.edu>
Date: Tuesday, 1 July 2025 at 21:58
To: Grover Aayush <agr...@ethz.ch>
Cc: Valentina Boeva <valenti...@inf.ethz.ch>, "gen...@soe.ucsc.edu" <gen...@soe.ucsc.edu>
Subject: Re: [genome] Re: Track hubs
Hello,
Thank you for sharing your UniversalEPI predictions hub and submitting a request to add it as a public track hub.
We have added this track hub to the queue. From an initial look, it appears that the UniversalEPI predictions hub is missing the following, which are required for public hubs:
Hello,
Thank you for updating the track hub.
We took a look at the hub and noticed that the Hi-C Predictions tracks were built using the bigInteract track format. Did you consider using the hic track format to build these tracks?
We also noticed that some of the tracks in the Hi-C Predictions composite track have “raw” in their label, such as the A549 raw track. Were these tracks processed using the log-transformed ICE-normalized Hi-C format?
This is not required, but you may be interested to know that you can also build tracks using the dimensions setting in the trackDb file. This setting allows for a two-dimensional array of checkboxes based on groups in the track configuration interface:
In the trackDb file, you could add
subGroup2
and dimensions
settings to the universalepi_hic composite track, such as the following:
Then, for each subtrack of the universalepi_hic composite track, you can include the cellType in the subGroups
line, for example:
Here is a snippet of the trackDb file with these settings:
https://hgwdev.gi.ucsc.edu/~gperez2/mlq/hub_35990/UniversalEPI/hg38/trackDb_edit.txt
Here is a session with the UniversalEPI ENCODE hub attached, and these trackDb edits applied:
https://genome.ucsc.edu/s/gperez2/UniversalEPI_dimensions
I hope this is helpful. Please include gen...@soe.ucsc.edu in any replies to ensure visibility by the team. All messages sent to that address are archived on a publicly-accessible Google Groups forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
Gerardo Perez
UCSC Genomics Institute
Dear Gerardo,
Thank you for the suggestions.
The hub is now modified with these changes (https://boevalab.inf.ethz.ch/resources/universalepi_pred_encode/ucsc_track_hub/hub.txt). Looking forward to your comments.
Many thanks,
Aayush
From:
Gerardo Perez <gpe...@ucsc.edu>
Date: Wednesday, 16 July 2025 at 02:09
To: Grover Aayush <agr...@ethz.ch>
Cc: Valentina Boeva <valenti...@inf.ethz.ch>, "gen...@soe.ucsc.edu" <gen...@soe.ucsc.edu>
Subject: Re: [genome] Re: Track hubs
Hello,
Thank you for updating the track hub.
We took a look at the hub and noticed that the Hi-C Predictions tracks were built using the bigInteract track format. Did you consider using the hic track format to build these tracks?
We also noticed that some of the tracks in the Hi-C Predictions composite track have “raw” in their label, such as the A549 raw track. Were these tracks processed using the log-transformed ICE-normalized Hi-C format?
This is not required, but you may be interested to know that you can also build tracks using the dimensions setting in the trackDb file. This setting allows for a two-dimensional array of checkboxes based on groups in the track configuration interface:
Hello, Aayush.
Thanks for the explanation. The current display feels very dense, with too many connections. It might help to apply stronger filters, use the hic format, or try a filtered version with hic. Let us know if you would like to explore that option.
We did notice that some tracks share the same longLabel, which can be confusing when displayed in the browser graphic.
For example, the a549_rgs and a549_asc tracks have the same longLabel:
You can add more detail to the longLabel so that the two tracks are easier to distinguish, for example:
longLabel A549 rgs raw
longLabel A549 asc raw
We have added the hub to our list of Public Hubs: https://genome.ucsc.edu/cgi-bin/hgHubConnect?#publicHubs
We traditionally announce new public hubs. Below is a draft announcement, along with a condensed version for Bluesky/X. Let us know if any information seems inaccurate, if you have any tags you would like us to include, or if the credits statement properly acknowledges the relevant group(s).
Full announcement draft:
We are pleased to announce the release of a new Public Hub, UniversalEPI ENCODE for hg38. This track hub displays normalized ATAC-seq and Hi-C interaction predictions based on ENCODE data and generated using UniversalEPI. UniversalEPI is an attention-based deep ensemble model that predicts enhancer–promoter interactions (EPIs) up to 2 Mb apart using only DNA sequence and chromatin accessibility (ATAC-seq) data. The hub contains two composite tracks:
We would like to thank the Boeva Lab at ETH Zurich for creating this hub.
Social media condensed version draft :
We are pleased to announce our latest public hub, UniversalEPI ENCODE for hg38. It shows ATAC-seq and Hi-C interaction predictions based on ENCODE data, generated by UniversalEPI.
Thanks to the Boeva Lab at ETH Zurich for creating this hub.
I hope this is helpful. Please include gen...@soe.ucsc.edu in any replies to ensure visibility by the team. All messages sent to that address are archived on a publicly-accessible Google Groups forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
Gerardo Perez
UCSC Genomics Institute
Dear Gerardo,
Thank you for the suggestions. Glad to see that the UniversalEPI predictions are now available as a public hub.
I agree that the current visualization looks very dense. Is it possible to configure the current bigInteract files in trackdb such that they show interactions with “both ends in window” by default? In this case, it looks somewhat better (see below). Otherwise, is it possible to apply a stronger filter by default in trackdb without changing the input file i.e. we can show all interactions greater than score of 200 by default but the user can choose to set the threshold to 0 in the track settings if needed? If these cannot be applied to bigInteract, we will be happy to consider converting the files to hic.
We have also updated the longLabel for different experiments of the same cell lines, as suggested.
Finally, announcement drafts read well! In the condensed version, it might seem like UniversalEPI predicts both Hi-C and ATAC-seq, which might be misleading so I would suggest the following for the social media condensed version:
“We are pleased to announce our latest public hub, UniversalEPI ENCODE for hg38. It shows Hi-C interaction predictions based on ENCODE ATAC-seq data, generated by UniversalEPI.
Thanks to the Boeva Lab at ETH Zurich for creating this hub.”
Everything else looks great to me! Once again, thank you for the support in making our predictions publicly available.
Many thanks,
Aayush
From:
Gerardo Perez <gpe...@ucsc.edu>
Date: Tuesday, 29 July 2025 at 18:15
To: Grover Aayush <agr...@ethz.ch>
Cc: Valentina Boeva <valenti...@inf.ethz.ch>, "gen...@soe.ucsc.edu" <gen...@soe.ucsc.edu>
Subject: Re: [genome] Re: Track hubs
Hello, Aayush.
Thanks for the explanation. The current display feels very dense, with too many connections. It might help to apply stronger filters, use the hic format, or try a filtered version with hic. Let us know if you would like to explore that option.
We did notice that some tracks share the same longLabel, which can be confusing when displayed in the browser graphic.
Hello, Aayush.
Yes, you can add the endsVisible two setting to the trackDb.txt file to enable the “both ends in window” display by default. To apply a stronger filter, you can use the scoreFilter setting, for example, adding scoreFilter 200
will display only interactions with a score greater than 200 by default. Users can still adjust this threshold in the track settings if needed.
Thank you also for the feedback on the announcement. We will go ahead and use your suggested condensed version for the social media post:
We are pleased to announce our latest public hub, UniversalEPI ENCODE for hg38. It shows Hi-C interaction predictions based on ENCODE ATAC-seq data, generated by UniversalEPI: link to hub
Thanks to the Boeva Lab at ETH Zurich for creating this hub.
We are planning to send out the release announcement tomorrow (08/05/2025).
We appreciate the refinements you have made to the hub and your efforts to make it publicly available.
I hope this is helpful. Please include gen...@soe.ucsc.edu in any replies to ensure visibility by the team. All messages sent to that address are archived on a publicly-accessible Google Groups forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
Gerardo Perez
UCSC Genomics Institute
Hi Gerardo,
Thank you for the suggestions. I have now modified the trackDb.txt with “endsVisible two” and “scoreFilter 200”.
Looking forward to announcement tomorrow.
In the trackDb file, you could add
subGroup2
anddimensions
settings to the universalepi_hic composite track, such as the following:
Hi Gerardo,
Thank you for the suggestions. I have now modified the trackDb.txt with “endsVisible two” and “scoreFilter 200”.
Looking forward to announcement tomorrow.
Date: Tuesday, 5 August 2025 at 22:53
In the trackDb file, you could add
subGroup2
anddimensions
settings to the universalepi_hic composite track, such as the following: