InteracTor: A new integrative feature extraction toolkit for improved characterization of protein structural properties

Jose Cleydson F. Silva
Layla Schuster
Nick Sexson
Matias Kirst
Marcio F. R. Resende
Raquel Dias

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (Arcadia Science)

Abstract

Understanding the structural and functional diversity of protein families is crucial for elucidating their biological roles. Traditional analyses often focus on primary and secondary structures, which include amino acid sequences and local folding patterns like alpha helices and beta sheets. However, primary and secondary structures alone may not fully represent the complex interactions within proteins. To address this limitation, we developed a new algorithm (InteracTor) to analyze proteins by extracting features from their three-dimensional (3D) structures. The toolkit extracts interatomic interaction features such as hydrogen bonds, van der Waals interactions, and hydrophobic contacts, which are crucial for understanding protein dynamics, structure, and function. Incorporating 3D structural data and interatomic interaction features provides a more comprehensive understanding of protein structure and function, potentially enhancing downstream predictive modeling capabilities. By using the extracted features in Mutual Information scoring (MI), Principal Component Analysis (PCA), t-distributed Stochastic Neighbor Embedding (t-SNE), Uniform Manifold Approximation and Projection (UMAP), and hierarchical clustering analysis as use cases, we identified clear separations among protein structural families, highlighting distinct functional aspects. Our analysis revealed that interatomic interaction features were more informative than protein secondary structure features, providing insights into potential structural and functional properties. These findings underscore the significance of considering tertiary structure in protein analysis, offering a robust framework for future studies aiming at enhancing the capabilities of models for protein function prediction and drug discovery.

Arcadia Science
Oct 25, 2024

Among the 12 most highly ranked features across protein families are hydrogen bonds (MI=0.775), total surface tension (MI=0.763), london dispersion forces (MI=0.758), repulsive interactions (MI=0.722), internal tension (MI=0.708), ASA (MI=0.694), hydrophobic contacts (MI=0.561), TG frequency (MI=0.562), internal hydrophobicity (MI=0.561), VN frequency (MI=0.556), total hydrophobicity (MI=0.539), and GG frequency (MI=0.509).

This is really interesting! I think it could also be interesting to see if any of the features (these or others) correlate or if any features could be predictive of others?

Read the original source
Arcadia Science
Oct 25, 2024

Here we present InteracTor, a new toolkit for the extraction of three types of protein feature encodings: interaction features, physicochemical features, and compositional features.

This is super cool! I can't wait to try it out!

Read the original source
Arcadia Science
Oct 25, 2024

Extract atom, residue, and sequence information from PDB file (Figure 1A): This step involves parsing the Protein Data Bank (PDB) file to obtain the atomic types, 3D coordinates, and the amino acid sequence of the protein

I'm curious if you can use this with structures predicted by AlphaFold or ESMFold. Related to that, I'm curious if you need to do any sort of pre-processing of the structures (mostly for AlphaFold and ESMFold structures because they're known to not always have optimal side chain placement).

Read the original source
Arcadia Science
Oct 25, 2024

A)

I think this figure might also be mixed up.

Read the original source
Arcadia Science
Oct 25, 2024

A)

I think I only see one panel in this figure

Read the original source
Version published to 10.1101/2024.10.07.616705v1 on bioRxiv
Oct 11, 2024

Combining structural modeling and deep learning to calculate the E. coli protein interactome and functional networks

This article has 8 authors:
1. H. Zhao
2. C. Velez
3. A. Navarene
4. A. Saha
5. J. Feldman
6. J. Skolnick
7. D. Murray
8. B. Honig
This article has no evaluationsLatest version May 12, 2025
Unraveling cooperative and competitive interactions within protein triplets in the human interactome

This article has 3 authors:
1. Aimilia-Christina Vagiona
2. Pablo Mier
3. Miguel A. Andrade-Navarro
This article has no evaluationsLatest version Jun 15, 2025
Exploring Protein Patterns, Cavity Interactions, and Therapeutic Insights in Cancer

This article has 3 authors:
1. Paloma Tejera-Nevado
2. Belén Otero-Carrasco
3. Alejandro Rodríguez-González
This article has no evaluationsLatest version Jun 6, 2025

InteracTor: A new integrative feature extraction toolkit for improved characterization of protein structural properties

This article has been Reviewed by the following groups

Listed in

Abstract

Article activity feed

Combining structural modeling and deep learning to calculate the E. coli protein interactome and functional networks

Unraveling cooperative and competitive interactions within protein triplets in the human interactome

Exploring Protein Patterns, Cavity Interactions, and Therapeutic Insights in Cancer

This article has been Reviewed by the following groups

Listed in

Abstract

Article activity feed

Related articles

Combining structural modeling and deep learning to calculate the E. coli protein interactome and functional networks

Unraveling cooperative and competitive interactions within protein triplets in the human interactome

Exploring Protein Patterns, Cavity Interactions, and Therapeutic Insights in Cancer