AnVIL Demos: Open Discussion Forum on April 15, 2026

Q: We’re organizing communication with a collaborating group. They are also interested to use big data in the cloud platform, interested to use All of Us. Interested to introduce the AnVIL platform to this collaboration. Would appreciate suggestions for how to introduce a potential new user. What material is the best to share with them?

A: Is the intent of this group and collaboration to analyze with All of Us data?

Q: In our experience, we’ve been using AnVIL platform to analyze our own data. We’re interested to merge it with the All of Us cohort to make a larger one. There are scientists managing a large cohort at their institution. They’d like to combine the All of Us cohort with their own dataset as a use case. We’d like to introduce them based on our good experience with AnVIL and they can combine with All of Us later.

A: The place we start often is the AnVIL Getting Started Guide: Getting Started on AnVIL . This includes many of the logistics and details needed for users to learn about how to get set up on AnVIL.

We’d also encourage you to share the monthly AnVIL Demos where people can attend virtually and talk with real people about their questions and research plans: About the AnVIL Demos category .

There is also the AnVIL Support Forum (help.anvilproject.org), where we’d like to encourage them to post any questions they have any time. We can respond in a timely way and connect with other AnVIL team members to find answers to their questions asynchronously.

Q: Thank you, we will encourage them to review these materials.

A: One of the places we can encourage users to start as early if they’re wanting to get on AnVIL is to start setting up a Google Billing Account at their institution. People typically work with NIH STRIDES which can provide discounts on cloud computing costs for extramural and intramural NIH investigators. This does take a bit of time and paperwork to set up, so we encourage people to start early.

Q: We have worked with ODSS to set up billing accounts for AnVIL and for All of Us. These are fairly similar since they are both at GCP. Thank you.

--

Q: I will upload some data. I want to know how your staff check the data and how to avoid correct the data and to upload it again. We are part of a consortium. NIH requires certain data to be uploaded to be in compliance of the NIH Data Management and Sharing Plan. We already uploaded once and will upload a second time. We want to make sure it meets the requirements so that we to this correctly.

A: There are two general categories of data that are submitted: data that are generated and funded by large NHGRI consortia for which data storage be supported by the AnVIL Team, and data that are generated by grantees that need to deposit to be in compliance with the NIH Data Sharing Policy where storage would not be supported by the AnVIL Team.

I would recommend reviewing the materials here: Submitting Data - AnVIL Portal , particularly Step 1 - Register Study / Obtain Approvals: Step 1 - Register Study / Obtain Approvals - AnVIL Portal .

A: AnVIL has a rigorous protocol for QC and review of data. This is most applied to data for AnVIL. You can create your own workspaces and Google buckets, bring your data in, store it, and share it more broadly.

Q: We are submitting not genomic data, but clinical data.