Toolkit/InterProScan

InterProScan

Taxonomy: Technique Branch / Method. Workflows sit above the mechanism and technique branches rather than replacing them.

Summary

Functional annotation combined eggNOG-mapper, InterProScan and BLASTp searches against UniProtKB/Swiss-Prot

Usefulness & Problems

No literature-backed usefulness or problem-fit explainer has been materialized for this record yet.

Published Workflows

Objective: Develop a fungal-specific functional annotation workflow for rapid and accurate functional analyses downstream of RNA-seq without requiring a reference genome.

Why it works: The abstract states that integrating homology searches against fungal-specific databases with expression pattern-based annotations improves functional interpretation and target identification, while the workflow is designed to operate without reference genomes that are often unavailable for non-model fungi.

homology-based functional annotation using fungal-specific databasesexpression pattern-based annotation integrationfunctional enrichment analysisRNA-seq analysisIso-Seq applicabilityhomology searchfunctional enrichment analysis

Stages

  1. 1.
    homology search against fungal-specific databases(broad_screen)

    The abstract explicitly states that the workflow integrates homology searches against fungal-specific databases, addressing limitations of broadly taxonomic tools for fungal transcriptomes.

    Selection: sequence homology against fungal-specific databases

  2. 2.
    expression pattern-based annotation integration(secondary_characterization)

    The abstract states that expression pattern-based annotations were integrated with homology searches to highlight utility for target identification.

    Selection: expression pattern-based annotation signals integrated with homology results

  3. 3.
    functional enrichment analysis(functional_characterization)

    The abstract reports that functional enrichment analyses revealed higher-resolution functional detection than existing annotation tools.

    Selection: functional enrichment of annotated transcript sets

Objective: Re-assemble, curate, structurally annotate, functionally annotate, and assess completeness of the argan tree nuclear genome to produce an openly available annotation dataset.

Why it works: The workflow combines re-assembly and curation with multiple gene prediction tools, integrates those predictions, then adds functional annotation and completeness assessment to produce a more comprehensive genome resource.

ab initio gene predictionintegration of multiple gene prediction outputsfunctional annotation by domain and sequence similaritycompleteness assessmentgenome re-assemblyannotation integrationfunctional annotation pipelinebenchmarking with BUSCO

Stages

  1. 1.
    Genome re-assembly and curation(library_build)

    This stage creates the draft genome assembly that all downstream annotation steps depend on.

    Selection: Use previously generated Illumina whole-genome shotgun reads together with the corresponding GenBank assembly to produce a curated draft assembly.

  2. 2.
    Ab initio gene prediction and integration(functional_characterization)

    This stage generates the structural gene annotation needed before functional annotation can be assigned.

    Selection: Predict genes with AUGUSTUS and GeneMark-ES and integrate predictions with EVidenceModeler.

  3. 3.
    Functional annotation(secondary_characterization)

    This stage adds biological interpretation and external evidence support to predicted genes and proteins.

    Selection: Assign curated functions, domains, and Gene Ontology terms using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Prot.

  4. 4.
    Completeness assessment(confirmatory_validation)

    This stage evaluates the completeness of the produced genome resource.

    Selection: Assess assembly gene space and predicted proteome completeness with BUSCO.

Steps

  1. 1.
    Re-assemble and curate the argan nuclear genome draft from existing Illumina reads and the corresponding GenBank assembly

    Generate the draft assembly that serves as the substrate for downstream annotation.

    Structural and functional annotation require a genome assembly first.

  2. 2.
    Run ab initio gene prediction with AUGUSTUS and GeneMark-ESgene prediction components

    Generate candidate gene models from the curated assembly.

    Gene models must be predicted before they can be integrated and functionally annotated.

  3. 3.
    Integrate ab initio predictions with EVidenceModelerprediction integrator

    Combine multiple prediction outputs into a unified structural annotation set.

    Integration follows prediction because EVidenceModeler uses the upstream predictor outputs.

  4. 4.
    Assign functions, domains, and Gene Ontology terms using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Protfunctional annotation components

    Add biological interpretation and external evidence support to predicted genes and proteins.

    Functional annotation depends on having predicted genes and proteins from the structural annotation stage.

  5. 5.
    Assess assembly gene space and predicted proteome completeness with BUSCOevaluation component

    Evaluate completeness of the resulting genome resource.

    Completeness assessment is performed after assembly and annotation outputs are available.

Taxonomy & Function

Primary hierarchy

Technique Branch

Method: A concrete method used to build, optimize, or evolve an engineered system.

Techniques

No technique tags yet.

Target processes

No target processes tagged yet.

Validation

Cell-freeBacteriaMammalianMouseHumanTherapeuticIndep. Replication

Supporting Sources

Ranked Claims

Claim 1annotation resultsupports2026Source 1needs review

Ab initio gene prediction with AUGUSTUS and GeneMark-ES integrated by EVidenceModeler produced 51,078 protein-coding genes and 2,081 non-coding RNA genes.

non coding rna genes predicted 2081protein coding genes predicted 51078
Claim 2benchmark resultsupports2026Source 1needs review

BUSCO analyses indicate high completeness of the assembly gene space and predicted proteome completeness of 74.6%.

predicted proteome completeness 74.6 %
Claim 3functional annotation resultsupports2026Source 1needs review

Functional annotation using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Prot assigned curated functions, domains, and Gene Ontology terms to 32,785 genes and supported 25,484 proteins with UniProt evidence.

genes with curated function domain go annotation 32785proteins with uniprot evidence 25484

Approval Evidence

2 sources1 linked approval claimfirst-pass slug interproscan
Functional annotation combined eggNOG-mapper, InterProScan and BLASTp searches against UniProtKB/Swiss-Prot

Source:

The web research summary states that InterProScan is an explicit annotation component in the anchor workflow for protein-domain/function assignment.

Source:

functional annotation resultsupports

Functional annotation using eggNOG-mapper, InterProScan, and BLASTp against UniProtKB/Swiss-Prot assigned curated functions, domains, and Gene Ontology terms to 32,785 genes and supported 25,484 proteins with UniProt evidence.

Source:

Comparisons

No literature-backed comparison notes have been materialized for this record yet.

Ranked Citations

  1. 1.

    Extracted from this source document.