Toolkit/EVidenceModeler

EVidenceModeler

Taxonomy: Technique Branch / Method. Workflows sit above the mechanism and technique branches rather than replacing them.

Summary

Ab initio gene prediction with AUGUSTUS and GeneMark-ES, integrated by EVidenceModeler

Usefulness & Problems

No literature-backed usefulness or problem-fit explainer has been materialized for this record yet.

Published Workflows

Objective: Re-assemble, curate, structurally annotate, functionally annotate, and assess completeness of the argan tree nuclear genome to produce an openly available annotation dataset.

Why it works: The workflow combines re-assembly and curation with multiple gene prediction tools, integrates those predictions, then adds functional annotation and completeness assessment to produce a more comprehensive genome resource.

ab initio gene predictionintegration of multiple gene prediction outputsfunctional annotation by domain and sequence similaritycompleteness assessmentgenome re-assemblyannotation integrationfunctional annotation pipelinebenchmarking with BUSCO

Stages

  1. 1.
    Genome re-assembly and curation(library_build)

    This stage creates the draft genome assembly that all downstream annotation steps depend on.

    Selection: Use previously generated Illumina whole-genome shotgun reads together with the corresponding GenBank assembly to produce a curated draft assembly.

  2. 2.
    Ab initio gene prediction and integration(functional_characterization)

    This stage generates the structural gene annotation needed before functional annotation can be assigned.

    Selection: Predict genes with AUGUSTUS and GeneMark-ES and integrate predictions with EVidenceModeler.

  3. 3.
    Functional annotation(secondary_characterization)

    This stage adds biological interpretation and external evidence support to predicted genes and proteins.

    Selection: Assign curated functions, domains, and Gene Ontology terms using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Prot.

  4. 4.
    Completeness assessment(confirmatory_validation)

    This stage evaluates the completeness of the produced genome resource.

    Selection: Assess assembly gene space and predicted proteome completeness with BUSCO.

Steps

  1. 1.
    Re-assemble and curate the argan nuclear genome draft from existing Illumina reads and the corresponding GenBank assembly

    Generate the draft assembly that serves as the substrate for downstream annotation.

    Structural and functional annotation require a genome assembly first.

  2. 2.
    Run ab initio gene prediction with AUGUSTUS and GeneMark-ESgene prediction components

    Generate candidate gene models from the curated assembly.

    Gene models must be predicted before they can be integrated and functionally annotated.

  3. 3.
    Integrate ab initio predictions with EVidenceModelerprediction integrator

    Combine multiple prediction outputs into a unified structural annotation set.

    Integration follows prediction because EVidenceModeler uses the upstream predictor outputs.

  4. 4.
    Assign functions, domains, and Gene Ontology terms using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Protfunctional annotation components

    Add biological interpretation and external evidence support to predicted genes and proteins.

    Functional annotation depends on having predicted genes and proteins from the structural annotation stage.

  5. 5.
    Assess assembly gene space and predicted proteome completeness with BUSCOevaluation component

    Evaluate completeness of the resulting genome resource.

    Completeness assessment is performed after assembly and annotation outputs are available.

Taxonomy & Function

Primary hierarchy

Technique Branch

Method: A concrete method used to build, optimize, or evolve an engineered system.

Techniques

No technique tags yet.

Target processes

No target processes tagged yet.

Validation

Cell-freeBacteriaMammalianMouseHumanTherapeuticIndep. Replication

Supporting Sources

Ranked Claims

Claim 1annotation resultsupports2026Source 1needs review

Ab initio gene prediction with AUGUSTUS and GeneMark-ES integrated by EVidenceModeler produced 51,078 protein-coding genes and 2,081 non-coding RNA genes.

non coding rna genes predicted 2081protein coding genes predicted 51078
Claim 2benchmark resultsupports2026Source 1needs review

BUSCO analyses indicate high completeness of the assembly gene space and predicted proteome completeness of 74.6%.

predicted proteome completeness 74.6 %
Claim 3functional annotation resultsupports2026Source 1needs review

Functional annotation using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Prot assigned curated functions, domains, and Gene Ontology terms to 32,785 genes and supported 25,484 proteins with UniProt evidence.

genes with curated function domain go annotation 32785proteins with uniprot evidence 25484

Approval Evidence

1 source1 linked approval claimfirst-pass slug evidencemodeler
Ab initio gene prediction with AUGUSTUS and GeneMark-ES, integrated by EVidenceModeler

Source:

annotation resultsupports

Ab initio gene prediction with AUGUSTUS and GeneMark-ES integrated by EVidenceModeler produced 51,078 protein-coding genes and 2,081 non-coding RNA genes.

Source:

Comparisons

No literature-backed comparison notes have been materialized for this record yet.

Ranked Citations

  1. 1.

    Extracted from this source document.