NFDI4Microbiota FlexFund 2026

BioXend
Microbial Biotransformation of Xenobiotics

A community-driven computational framework for standardizing, collecting, and submitting microbial biotransformation data to ChEMBL — making xenobiotic metabolism research FAIR and reproducible.

View Standards Read the Docs View on GitHub
version MIT License PRs welcome Bioschemas ChEMBL

Four work packages/ components

Microbial biotransformation of xenobiotics — the process by which microorganisms chemically modify drugs, environmental contaminants, and other foreign compounds — is poorly represented in public databases due to a lack of reporting standards. BioXend fixes that.


Work Packages

WP1

Minimum Standards

Develop community-consensus minimum reporting standards (MIX-MB) covering chemicals, microbes, and biotransformation assays.

WP2

Metadata Automation

Automate metadata collection from user inputs, external APIs (ChEMBL, NCBI Taxonomy), and Bioschemas-aligned identifiers.

WP3

ChEMBL Submission Workflow

Nextflow DSL2 pipeline that generates seven ChEMBL-ready submission files from simple tabular inputs..

WP4

NFDI4Microbiota Integration

Release BioXend as part of the NFDI4Microbiota workflow ecosystem for broad community adoption and long-term maintenance.

Minimum Information about Xenobiotics-Microbiome Biotransformation

Three interconnected sub-standards define exactly what metadata must be reported for reproducible and database-ready experiments. The standards are continuously undergoing updates and are versioned. We appreciate your feedback.

MIX-MB Overview

Standards Main Document

Top-level overview of all three sub-standards, version matrix, identifier scheme, ChEMBL field mappings, and FAIR alignment. Start here to understand the full MIX-MB framework.

View MIX-MB Standards overview →
MIX-MB(X)

Xenobiotics / Chemicals

Minimum metadata for the chemical substrate of biotransformation reactions: structure, identity, and classification.

  • Comp_ID Compound index identifier
  • SMILES Valid canonical SMILES or InChIKey
  • COMPOUND_NAME Preferred chemical name
  • ChEBI / ChEMBL ID Cross-reference identifiers
View MIX-MB(X) standard →
MIX-MB(M)

Microbes / Organisms

Minimum metadata for the microbial agent: taxonomy, culture conditions, and sample provenance — aligned to Bioschemas Taxon profile.

  • NCBI TaxID Primary taxonomy identifier
  • ASSAY_ORGANISM Organism name (genus, species)
  • O2_CONDITION Oxygen requirement (controlled vocab)
  • SAMPLE_TYPE Bacterium, community, enzyme
  • ENA / BioSample ID Sequencing data links
View MIX-MB(M) standard →
MIX-MB(B)

Biotransformation / Assay

Minimum metadata for the experimental assay and activity: conditions, analytical methods, products, and activity measurements.

  • Assay_ID Assay index identifier
  • ASSAY_TYPE bacteria / enzyme / community
  • ACTION_TYPE Substrate / Product / No Activity
  • TEXT_VALUE Measured activity value
  • ASSAY_DESCRIPTION Free-text assay description
View MIX-MB(B) standard →

Submission Template

Template_open.ods — 5 sheets covering all three standards

Download Template_open.ods ↓

5 template sheets

Reference · Chemical · Microbe · Experiment · Activity

Color coding

Green = Mandatory   Blue = Recommended   Yellow = Optional


Help shape the standards

MIX-MB is community-driven. Your expertise in microbiology, metabolomics, cheminformatics, or data science directly shapes what gets standardized. Every form of participation counts.

Survey

Fill out the Survey

No GitHub account needed. Share your opinion on which fields are truly essential for biotransformation data.

Open Survey →
Issue

Open an Issue

Propose a new field, report a bug, or flag missing vocabulary terms via GitHub Issues — templates provided.

Open an Issue →
Discussion

Join the Discussion

Have a broader idea or question? GitHub Discussions is the place for open-ended proposals and community debate.

Go to Discussions →
Endorse

Endorse Proposals

Browse the Project Board and add a thumbs-up to issues and PRs you support. Community endorsement drives the standards forward.

View Project Board →
PR

Submit a Pull Request

Comfortable with Git? Submit PRs against the devel branch. Standards changes require a 14-day review period.

Read Contributing Guide →
Analysis

View Survey Results

Explore community survey results and analysis to see what the field considers most important to standardize.

Browse Analysis →

Automated ChEMBL Submission Workflow

A Nextflow pipeline that generates seven ChEMBL-ready submission files from a single filled-out Template_open.ods.


Pipeline Steps

1

Reference

Reads publication metadata from the Reference sheet

REFERENCE.tsv README.toml
2

Chemicals

Validates SMILES and generates 2D structures via RDKit

COMPOUND_RECORD.tsv COMPOUND_CTAB.sdf
3

Microbes

Builds assay entries per organism and condition

ASSAY.tsv
4

Experiment

Links experimental parameters to each assay by AIDX

ASSAY_PARAM.tsv
5

Biotransformation

Links compounds to assays and writes activity records

ACTIVITY.tsv

Quickstart

Requires Nextflow ≥ 22.10.0 and Docker.

# 1. Build the Docker image (once) $ docker build -t bioxend:latest . # 2. Run the pipeline $ nextflow run main.nf -profile docker --input Standards/Templates/Template_open.ods --outdir results/ --prefix HMDM --xenobiotic_class drug
Full Usage Docs → Output Reference →

Who is behind BioXend?

MZ

Mahnoor Zulfiqar

Maintainer · EMBL, Molecular Systems Biology Unit

ORCID: 0000-0002-8330-4071 ↗
EM

Eleonora Mastrorilli

Maintainer · EMBL, Molecular Systems Biology Unit

ORCID: 0000-0003-2127-4150 ↗
ZG

Zimmermann Group

Host Lab · EMBL, Molecular Systems Biology Unit

embl.org/groups/zimmermann ↗
CBG

Chemical Biology Group

Collaborator · European Bioinformatics Institute (EBI)

ebi.ac.uk/about/teams/chemical-biology-services/ ↗
NFDI4Microbiota

BioXend is funded by the NFDI4Microbiota FlexFund 2026. NFDI4Microbiota is part of Germany's National Research Data Infrastructure (NFDI) initiative, dedicated to making microbiome research data FAIR and accessible.