Report Catalog¶
Complete index of all reports available in Biofilter 4. Each report has a name (used in CLI and Python API), a brief description, and links to its explain guide and interactive notebook tutorial where available.
For general usage — how to run, list, and introspect reports — see Reports.
Running any report¶
# CLI
biofilter report run --report-name <name> [--param KEY=VALUE ...] [--output file.csv]
biofilter report explain --report-name <name>
biofilter report run --report-name <name> --params-template
# Python API
df = bf.report.run("<name>", param1=value1, param2=value2)
ETL & Platform Monitoring¶
Reports for inspecting the state of the ETL pipeline and the knowledge base.
Report |
Description |
Explain |
Notebook |
|---|---|---|---|
|
Current status of all ETL packages (active, last run, row counts) |
||
|
Full provenance log of all ETL executions with timestamps and file hashes |
||
|
Row counts and coverage metrics across all master tables |
||
|
PostgreSQL table sizes, row estimates, and bloat metrics (PostgreSQL only) |
||
|
PostgreSQL index usage, size, and scan counts (PostgreSQL only) |
Entity & Relationship¶
Reports for exploring the biological entity graph.
Report |
Description |
Explain |
Notebook |
|---|---|---|---|
|
Filter and list entities (genes, pathways, diseases, …) by type, source, or name pattern |
||
|
Retrieve all entities related to an input list through shared biological groups (pathways, diseases, GO, PPI) |
— |
|
|
Resolve heterogeneous inputs (gene:, disease:, pathway:, …) and return a 1-hop neighborhood summary grouped by neighbor type |
Annotation Masters¶
Reference tables exposing the full content of each biological domain in the knowledge base. Useful for exploring available terms before using them as filters in other reports.
Report |
Description |
Explain |
Notebook |
|---|---|---|---|
|
All genes with HGNC symbol, Ensembl ID, locus, and source provenance |
||
|
All pathways across all source systems (Reactome, KEGG, …) |
||
|
All proteins with UniProt IDs and gene mappings |
||
|
All diseases with MONDO/ClinGen IDs and gene associations |
||
|
All Gene Ontology terms (BP, MF, CC) with gene memberships |
||
|
All chemical compounds (ChEBI) with gene and pathway associations |
||
|
Full annotation for input variants: frequencies, pathogenicity scores, VEP consequences per transcript, AlphaMissense |
Variant Analysis¶
Reports for annotating and filtering genomic variants.
Report |
Description |
Explain |
Notebook |
|---|---|---|---|
|
Assign variants to genomic bins; useful for burden-test preparation |
||
|
Map variants to overlapping gene loci with distance and region annotations |
— |
|
|
Full annotation expansion for a variant list (consequence, AF, predictions) |
— |
— |
|
Phase 1 — Given a seed variant, returns the seed gene and all partner genes sharing biological context |
||
|
Phase 2 — Collect and filter variants across a gene list with SQL-level pathogenicity filters |
||
|
Variant ↔ gene regulatory evidence (eQTL / sQTL). Accepts gene symbols, rsids, or chr:pos as input; returns one row per (variant × tissue × regulated gene) with effect size and p-value |
Variant Interaction Modeling¶
Direct variant-to-variant interaction modeling from a pre-genotyped input list. Both variants in every pair come from the input — no DB expansion.
Report |
Description |
Explain |
Notebook |
|---|---|---|---|
|
Input variants → gene overlap → group co-membership → Variant×Variant pairs with group_support_count weight |
SNP×SNP Interaction Pipeline¶
Reports implementing the biologically-informed SNP×SNP interaction workflow. See the full pipeline tutorial and methods document for end-to-end guidance.
Resource |
Link |
|---|---|
Pipeline notebook |
|
Pipeline methods doc |
Report |
Phase |
Description |
Explain |
Notebook |
|---|---|---|---|---|
|
Phase 1 |
Seed variant → partner gene list via biological network |
||
|
Phase 2 |
Gene list → filtered, annotated variant set (Lista A) |
||
|
Phase 2.5 |
Lista A ∩ Lista B → Lista C (genotyped subset, PLINK-ready) |
||
|
Phase 3 |
Lista D → annotated interaction pairs with configurable pairing strategy |
||
|
Legacy |
Earlier SNP×SNP pair model — expands variants from gene loci (superseded by |
Pathway Burden Pipeline¶
Pipeline for prioritising pathways given a list of significant genes (e.g., ExWAS hits) and a target pathway list, using cross-source convergence scoring.
Resource |
Link |
|---|---|
Pipeline notebook |
|
Pipeline methods doc |
Utilities¶
Report |
Description |
Explain |
Notebook |
|---|---|---|---|
|
Blank report template for development and testing |
Coverage summary¶
Status |
Count |
|---|---|
Reports with explain guide + notebook |
20 |
Reports with explain guide only |
2 ( |
Reports with notebook only |
2 ( |
Reports with neither |
1 ( |
Total |
25 |