Skip to main content
Mosaic Biodata

Bring Your Own Data

Already sequenced? Skip the kit. Run the reports.

If you already have your DNA file, Mosaic can build our reports from it. No new kit. Create a free account, upload your data, and run the report inside the platform.

Registration and upload are free. No payment until you decide to run a report.

How it works

Three steps. No surprises.

  1. Step 01

    Create a free account

    Registration and upload are free. No payment until you decide to run a report.

    Create your free account →
  2. Step 02

    Upload your data securely

    HIPAA-compliant pipeline. The file picker accepts .txt, .csv, .vcf, .vcf.gz, .gz up to 300 MB. We confirm the file is processable before it is uploaded.

  3. Step 03

    Purchase your report in the platform

    Once your data clears the quality check, the Mosaic reports are available to purchase and run. Report purchases happen inside the platform — reports are attached to your uploaded data file.

What we accept

The files Mosaic can read.

Three categories cover almost everything: consumer raw-genotype files, sequencing-service exports, and whole-genome VCFs. Plus a fallback parser for rsID-keyed files from smaller vendors we don’t explicitly list.

Consumer raw genotype

TXT / CSV

  • 23andMe
  • AncestryDNA
  • MyHeritage
  • FamilyTreeDNA
  • LivingDNA
  • tellmeGen
  • HomeDNA
  • WeGene

These are SNP-chip files. They cover a subset of the genome (≈600,000 positions) — Mosaic builds the report from whatever the chip captured and flags any insights gated by coverage. See the coverage note below.

Sequencing-service genotype

TXT

  • Sequencing.com (Ultimate Compatibility export)
  • Illumina GSGT clinical / pharmacogenomic arrays (TruV3, GSAv3)

Exports from intermediate platforms that have already processed your raw data into rsID-keyed genotype calls.

Whole-genome sequencing

VCF / VCF.gz

  • Sequencing.com WGS
  • Nebula Genomics
  • Dante Labs
  • Veritas Genetics
  • BGI
  • Illumina clinical lab

Single-sample, snp-indel variant calls from any clinical-grade WGS provider. Generic .vcf or gzipped .vcf.gz both work.

We also have a fallback parser for rsID-keyed TXT from smaller vendors we don’t explicitly list (Vitagene, Genos, others). Not sure if yours qualifies? Tell us about your file and we’ll confirm before you upload.

The fine print

Limits + how we handle the gaps.

Files we can’t accept (yet)

So you don’t waste time uploading the wrong thing.

  • Structural-variant VCFs

    Files ending in .sv.vcf.gz (e.g. Manta output) describe large rearrangements, not the SNP/indel calls Mosaic reports use. Look for your "snp-indel" companion file from the same vendor.

  • Multi-sample VCFs

    Files containing more than one person’s data. Mosaic processes single-sample uploads only.

  • Clinical pharmacogenomic gene-CSVs

    Files structured as Gene,Variant,Result (some clinical labs export this way). Mosaic needs rsID-keyed raw genotype data, not interpretation summaries.

  • JSON, XML, or HTML files

    These are interpretation outputs, not raw data. The underlying source file is what we need.

  • Gzipped non-VCF files

    A .txt.gz or .csv.gz needs to be decompressed first, then re-uploaded as .txt or .csv. (VCFs in .vcf.gz are fine, that’s the standard.)

  • Empty, corrupt, or oversized files

    Anything that fails integrity check, anything over 300 MB. The 300 MB cap is Phase 1; larger files may be supported later.

How we handle gaps in the data

We don’t impute from reference panels.

Some platforms use a method called imputation — they statistically guess at positions they never actually read, based on patterns in reference populations. We don’t. If your data doesn’t cover a position, the report tells you. The blanks stay blank.

The one technical exception: when a VCF file is set up to list only positions that differ from the reference (a standard file-format choice that saves space), we read the absences as matches. That’s reading the file format correctly, not filling in statistical guesses.

Pricing

BYOD advantage.

Lifestyle Optimization Report

$199

per qualifying upload

Any qualifying upload — whole-genome VCF or SNP-chip file. Every future Mosaic report runs on the same uploaded data. Buy once per report, not once per sequencing.

Provider pricing is shown inside an approved provider account and is not displayed on the public site.

Coverage note for SNP-chip uploads

SNP-chip files from 23andMe, AncestryDNA, MyHeritage, and similar consumer tests cover a tiny fraction of your DNA — roughly 600,000 positions out of 3 billion base pairs. Mosaic builds the most complete Lifestyle Optimization Report your data can support, but some insights won’t be available because the chip didn’t capture the relevant positions. Your report tells you which insights are gated and why.

Want everything the report can give you?

A Mosaic kit reads all of your DNA. Once, forever.

Whole-genome sequencing at 30× clinical-grade depth. Every chromosome, every gene, every variant — captured once, used forever. Every future Mosaic report runs on this data. Any other platform that reads standard genomic files does too. You own the underlying record forever.

Optional: confirm first

Not sure if your data qualifies?

Tell us the vendor, year, format, and depth (if known). We respond within one business day. The default path is “register and upload.” This form is for cautious users or borderline data.

Where the sequencing was done.

Whole-genome at 30× is the bar. Higher is fine.

Tell us what you’ve got; we’ll confirm what we can use.

Don’t attach the data yet — we’ll set up a secure transfer once we confirm eligibility.

We’ll only use this information to evaluate eligibility and pricing for your Mosaic report. See our privacy policy.

Your Genome. Your Rules.

  • Lifetime data ownership

    You own 100% of your raw genome, forever. Downloadable in standard formats, readable by any analysis tool, including every Mosaic report current and future.

  • No selling, sharing, or monetizing

    We never sell, share, license, or monetize your genome — full stop. Pseudonymized, encrypted, HIPAA-compliant pipeline end to end.

  • Clinical-grade lab

    30× whole-genome sequencing at a CLIA-, COLA-, CAP-, and AABB-accredited U.S. lab. Real clinical infrastructure, not consumer-grade genotyping.

  • Built inside clinical practice

    Refined across 10+ years of human-performance and longevity work at Apeiron Center. Defensible in a clinical setting; usable in real life.

  • Lab-failure protection

    If the lab can’t process your sample, the first reprocessing is free. If a second sample also fails, we refund your kit price minus a $20 processing fee.

That’s the Mosaic standard. No exceptions. No fine print.

Your data. A real report. Today.

No new kit. No waiting. Most uploads clear the quality check in minutes; reports render in your account from there.