I have data in one of my workspaces that I’d like to make publicly-accessible, for API access for users outside of AnVIL. The data is summary-level variant interpretation evidence, to be integrated into ClinGen. I could download the data and set up an API endpoint elsewhere, but it would be nice to do this within AnVIL. How are other people approaching the dissemination of their derived data? I’ve looked into the following options so far, and they have their pros and cons:
- Share the data on Terra as a featured Terra workspace. With the right settings, this workspace can be accessible by API to users outside Terra.
Pro: this can be set up quickly.
Pro: it can in principle be done today, without requiring any platform changes.
Con: there’s no way to get around the egress charges. In my case, the egress charges should be modest because the data is so small, but it requires the long-term commitment to cover those charges. I don’t want to use requestor-pays, because it doesn’t make sense in my use case (for reasons I won’t go into here for brevity).
- Public the data on Gen3
- Con: these data don’t seem to fit the Gen3 schema: there’s no sequencing information, and while there is relevant phenotype information, there’s no patient information. It seems like this just breaks the Gen3 model.
Thanks in advance!