Toolkit/BUSCO
BUSCO
Taxonomy: Technique Branch / Method. Workflows sit above the mechanism and technique branches rather than replacing them.
Summary
BUSCO analyses indicate high completeness of the assembly gene space and completeness of the predicted proteome (74.6%)
Usefulness & Problems
Why this is useful
BUSCO is used here to assess completeness of the assembly gene space and predicted proteome.; completeness assessment of genome assemblies and predicted proteomes
Source:
BUSCO is used here to assess completeness of the assembly gene space and predicted proteome.
Source:
completeness assessment of genome assemblies and predicted proteomes
Problem solved
It provides a standardized completeness check for the annotation resource.; evaluating completeness of assembly gene space and proteome predictions
Source:
It provides a standardized completeness check for the annotation resource.
Source:
evaluating completeness of assembly gene space and proteome predictions
Problem links
evaluating completeness of assembly gene space and proteome predictions
LiteratureIt provides a standardized completeness check for the annotation resource.
Source:
It provides a standardized completeness check for the annotation resource.
Published Workflows
Objective: Re-assemble, curate, structurally annotate, functionally annotate, and assess completeness of the argan tree nuclear genome to produce an openly available annotation dataset.
Why it works: The workflow combines re-assembly and curation with multiple gene prediction tools, integrates those predictions, then adds functional annotation and completeness assessment to produce a more comprehensive genome resource.
Stages
- 1.Genome re-assembly and curation(library_build)
This stage creates the draft genome assembly that all downstream annotation steps depend on.
Selection: Use previously generated Illumina whole-genome shotgun reads together with the corresponding GenBank assembly to produce a curated draft assembly.
- 2.Ab initio gene prediction and integration(functional_characterization)
This stage generates the structural gene annotation needed before functional annotation can be assigned.
Selection: Predict genes with AUGUSTUS and GeneMark-ES and integrate predictions with EVidenceModeler.
- 3.Functional annotation(secondary_characterization)
This stage adds biological interpretation and external evidence support to predicted genes and proteins.
Selection: Assign curated functions, domains, and Gene Ontology terms using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Prot.
- 4.Completeness assessment(confirmatory_validation)
This stage evaluates the completeness of the produced genome resource.
Selection: Assess assembly gene space and predicted proteome completeness with BUSCO.
Steps
- 1.Re-assemble and curate the argan nuclear genome draft from existing Illumina reads and the corresponding GenBank assembly
Generate the draft assembly that serves as the substrate for downstream annotation.
Structural and functional annotation require a genome assembly first.
- 2.Run ab initio gene prediction with AUGUSTUS and GeneMark-ESgene prediction components
Generate candidate gene models from the curated assembly.
Gene models must be predicted before they can be integrated and functionally annotated.
- 3.Integrate ab initio predictions with EVidenceModelerprediction integrator
Combine multiple prediction outputs into a unified structural annotation set.
Integration follows prediction because EVidenceModeler uses the upstream predictor outputs.
- 4.Assign functions, domains, and Gene Ontology terms using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Protfunctional annotation components
Add biological interpretation and external evidence support to predicted genes and proteins.
Functional annotation depends on having predicted genes and proteins from the structural annotation stage.
- 5.Assess assembly gene space and predicted proteome completeness with BUSCOevaluation component
Evaluate completeness of the resulting genome resource.
Completeness assessment is performed after assembly and annotation outputs are available.
Taxonomy & Function
Primary hierarchy
Technique Branch
Method: A concrete measurement method used to characterize an engineered system.
Mechanisms
comparative completeness benchmarkingTechniques
Functional AssayTarget processes
No target processes tagged yet.
Implementation Constraints
The abstract supports that it is applied to the assembled genome and predicted proteome outputs.; requires an assembly or predicted proteome to assess
The abstract does not describe it as improving the assembly or generating annotations directly.; the abstract reports a proteome completeness value but does not provide deeper benchmark detail
Validation
Supporting Sources
Ranked Claims
Ab initio gene prediction with AUGUSTUS and GeneMark-ES integrated by EVidenceModeler produced 51,078 protein-coding genes and 2,081 non-coding RNA genes.
BUSCO analyses indicate high completeness of the assembly gene space and predicted proteome completeness of 74.6%.
Functional annotation using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Prot assigned curated functions, domains, and Gene Ontology terms to 32,785 genes and supported 25,484 proteins with UniProt evidence.
Approval Evidence
BUSCO analyses indicate high completeness of the assembly gene space and completeness of the predicted proteome (74.6%)
Source:
BUSCO analyses indicate high completeness of the assembly gene space and predicted proteome completeness of 74.6%.
Source:
Comparisons
Source-backed strengths
explicitly used as the reported completeness assessment method
Source:
explicitly used as the reported completeness assessment method
Compared with Langendorff perfused heart electrical recordings
BUSCO and Langendorff perfused heart electrical recordings address a similar problem space.
Shared frame: same top-level item type
Strengths here: looks easier to implement in practice.
Compared with native green gel system
BUSCO and native green gel system address a similar problem space.
Shared frame: same top-level item type
Strengths here: looks easier to implement in practice.
BUSCO and sub-picosecond pump-probe analysis of bacteriorhodopsin pigments address a similar problem space.
Shared frame: same top-level item type
Strengths here: looks easier to implement in practice.
Ranked Citations
- 1.