Functional annotation transfers GO terms from proteins of known function to your query sequences. PROTEA encodes each protein into a high-dimensional embedding, runs a KNN search against a reference annotation set, and aggregates the neighbors' GO terms into a ranked prediction per protein. You need three ingredients before launching a job: precomputed embeddings, a reference annotation set, and the query sequences you want to annotate.
Each step opens its dedicated page so you can build or pick the artefact.
Embedding model?
Reference annotation set?
Query set?