Hostname: page-component-cb9f654ff-pvkqz Total loading time: 0 Render date: 2025-08-13T09:45:49.315Z Has data issue: false hasContentIssue false

A structural and functional bioinformatics study of QTY-designed retinylidene proteins

Published online by Cambridge University Press:  14 July 2025

Siqi Pan*
Affiliation:
Independent Researcher
*
Corresponding author: Siqi Pan; Email: siqipan2008@outlook.com
Rights & Permissions [Opens in a new window]

Abstract

Retinylidene proteins are retinal-binding light-sensitive proteins found in organisms ranging from microbes to human. Microbial opsins have been utilized in optogenetics, while animal opsins are essential for vision and light-dependent metabolic functions. However, retinylidene proteins have hydrophobic transmembrane (TM) domains, which makes them challenging to study. In this structural and functional bioinformatics study, I use the QTY (glutamine, threonine, tyrosine) code to design water-soluble QTY analogues of retinylidene proteins, including nine human and three microbial opsins. I provide superpositions of the AlphaFold3-predicted hydrophobic native proteins and their water-soluble QTY analogues, and experimentally determined structures when available. I also provide a comparison of surface hydrophobicity of the variants. Despite significant changes to the protein sequence (35.53–50.24% in the TM domain), protein characteristics and structures are well preserved. Furthermore, I run molecular dynamics (MD) simulations of native and QTY-designed OPN2 (rhodopsin) and analyze their response to the isomerization of 11-cis-retinal to all-trans-retinal. The results show that the QTY analogue has similar functional behavior to the native protein. The findings of this study indicate that the QTY code can be used as a robust tool to design water-soluble retinylidene proteins. These have potential applications in protein studies, therapeutic treatments, and bioengineering.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

Introduction

Retinylidene proteins are photochemically reactive proteins that are bound to or can bind to retinal (vitamin A aldehyde) as their chromophore (Spudich et al., Reference Spudich, Yang, Jung and Spudich2000). For convenience, I will use the term ‘opsin’ interchangeably with ‘retinylidene protein’, despite the fact that ‘opsin’ sometimes refers specifically to the chromophore-free apoprotein. Retinylidene proteins are divided into two groups, animal opsins and microbial opsins, which are evolutionarily distinct but share common characteristics such as having seven transmembrane (TM) domains and a retinal-binding lysine residue (Spudich et al., Reference Spudich, Yang, Jung and Spudich2000; Yee et al., Reference Yee, Shlykov, Vastermark, Reddy, Arora, Sun and Saier2013). In this study, I will consider nine animal opsins and three microbial opsins.

Animal opsins are a type of class A GPCR (G-protein-coupled receptor), which are characterized by seven transmembrane domains, the NPxxY motif, and activation of G proteins through the outward movement of TM6 (Zhou et al., Reference Zhou, Yang, Wu, Guo, Guo, Zhong, Cai, Dai, Jang, Shakhnovich, Liu, Stevens, Lambert, Babu, Wang and Zhao2019). Almost all animal opsins include a lysine residue that forms a Schiff base link with the retinal (Gühmann et al., Reference Gühmann, Porter and Bok2022). When retinal absorbs a photon, it isomerizes, usually changing from 11-cis to all-trans. The subsequent conformational changes of the protein are well studied using bovine rhodopsin, which changes from dark state to BATHO state, then LUMI, META I, and META II (Okada et al., Reference Okada, Ernst, Palczewski and Hofmann2001). Proton transfer plays an important role in this process (Mahalingam et al., Reference Mahalingam, Martínez-Mayorga, Brown and Vogel2008). Afterward, the protein bleaches and returns to the dark state, completing what is called the photocycle. As the protein conformation changes to META, TM6 moves characteristically outward, activating the G protein. In photoreceptor cells, the G protein, transducin, activates a phosphodiesterase, which hydrolyzes cGMP into GMP, decreasing the activity of cGMP-gated cation channels and hyperpolarizing the cell (Chabre and Deterre, Reference Chabre and Deterre1989).

In this study, I select nine opsins expressed in the human nervous system: OPN1MW (UniProt ID: P04001), OPN1LW (UniProt ID: P04001), OPN1SW (UniProt ID: P03999), OPN2 (UniProt ID: P04001), OPN3 (UniProt ID: Q9H1Y3), OPN4 (UniProt ID: Q9UHM6), OPN5 (UniProt ID: Q6U736), RGR (UniProt ID: P47804), and RRH (UniProt ID: O14718). They belong to several evolutionarily distinct families of animal opsins (Terakita, Reference Terakita2005).

OPN1MW (medium-wave-sensitive opsin 1), OPN1LW (long-wave-sensitive opsin 1), and OPN1SW (short-wave-sensitive opsin 1) are expressed in retinal cone photoreceptors and are responsible for color vision (Bowmaker and Dartnall, Reference Bowmaker and Dartnall1980). Certain variants of OPN1MW, OPN1LW, and OPN1SW, respectively, cause deuteranopia, protanopia, and tritanopia, which are different types of color blindness (Baraas et al., Reference Baraas, Hagen, Dees and Neitz2012; Ueyama et al., Reference Ueyama, Kuwayama, Imai, Tanabe, Oda, Nishida, Wada, Shichida and Yamade2002). The absence of both functional OPN1MW and OPN1LW causes blue cone monochromacy, an X-linked congenital cone dysfunction syndrome (Wissinger et al., Reference Wissinger, Baumann, Buena-Atienza, Ravesh, Cideciyan, Stingl, Audo, Meunier, Bocquet, Traboulsi, Hardcastle, Gardner, Michaelides, Branham, Rosenberg, Andreasson, Dollfus, Birch, Vincent, Martorell, Català Mora, Kellner, Rüther, Lorenz, Preising, Manfredini, Zarate, Vijzelaar, Zrenner, Jacobson and Kohl2022), and cone dystrophy 5, an X-linked cone dystrophy (Gardner et al., Reference Gardner, Webb, Kanuga, Robson, Holder, Stockman, Ripamonti, Ebenezer, Ogun, Devery, Wright, Maher, Cheetham, Moore, Michaelides and Hardcastle2010).

OPN2 (opsin 2), also known as rhodopsin, is expressed in retinal rod photoreceptors and is responsible for vision at low light intensity (Hubbard and Kropf, Reference Hubbard and Kropf1958). Certain variants lead to autosomal recessive or autosomal dominant retinitis pigmentosa and congenital stationary night blindness (Fanelli et al., Reference Fanelli, Felline and Marigo2021). OPN2 is a representative animal opsin, one of the first studied. In fact, bovine OPN2 is the first opsin to be sequenced (Nathans and Hogness, Reference Nathans and Hogness1984), as well as the first GPCR whose crystal structure was resolved experimentally (Palczeski et al., Reference Palczeski, Behnke, Motoshima and Fox2000). Many studies on the functional mechanisms of animal opsins also focus on OPN2. Consequently, I chose OPN2 to conduct a functional analysis in this study in order to further explore the effectiveness of the QTY code in redesigning retinylidene proteins.

OPN3 (opsin 3), also known as encephalopsin or panopsin, is activated by blue and ultraviolet A light. It was discovered in the brain (Blackshaw and Snyder, Reference Blackshaw and Snyder1999). It is also expressed in melanocytes and keratinocytes in the skin and regulates functions such as melanogenesis, cell differentiation, and glucose uptake. Its expression is also found in the liver, pancreas, kidney, lung, heart, and skeletal muscles. When expressed in neurons, the release of neurotransmitters is inhibited by light, making OPN3 a useful inhibitory optogenetic tool (Copits et al., Reference Copits, Gowrishankar, O’Neill, Li, Girven, Yoo, Meshik, Parker, Spangler, Elerding, Brown, Shirley, Ma, Vasquez, Stander, Kalyanaraman, Vogt, Samineni, Patriarchi, Tian, Gautam, Sunahara, Gereau and Bruchas2021).

OPN4 (opsin 4), also known as melanopsin, is expressed in ipRGC (intrinsically photosensitive retinal ganglion cells) in the ganglion cell layer in the retina (Provencio et al., Reference Provencio, Jiang, De Grip, Hayes and Rollag1998). It is essential for non-image-forming responses to light, including the pupillary reflex, optokinetic visual tracking response, and photoentrainment and regulation of circadian rhythm. Certain variants of OPN4 lead to seasonal affective disorder and other circadian rhythm disorders (Berson et al., Reference Berson, Dunn and Takao2002). Rendering OPN4-containing ipRGCs capable of image formation is also a potential pathway for the treatment of various eye diseases such as retinitis pigmentosa and diabetic retinopathy.

OPN5 (opsin 5), also known as neuropsin, is activated by blue and ultraviolet A light (Tarttelin et al., Reference Tarttelin, Bellingham, Hankins, Foster and Lucas2003). It is expressed in the retina and contributes to the regulation of light-dependent vascular development and photoentrainment in the cornea and retina (Buhr et al., Reference Buhr, Yue, Ren, Jiang, Liao, Mei, Vemaraju, Nguyen, Reed, Lang, Yau and Van Gelder2015). Certain variants of OPN5 may lead to cycloplegia, paralysis of the ciliary muscle in the eye.

RGR (RPE-retinal GPCR) is expressed in RPE (retinal pigmented epithelium) and Müller cells in the retina (Shen et al., Reference Shen, Jiang, Hao, Tao, Salazar and Fong1994). Unlike the aforementioned human opsins, RGR preferentially binds all-trans-retinal and may catalyze its isomerization into 11-cis-retinal via a retinochrome-like mechanism. It is expressed only in tissue surrounding photoreceptors and plays a role in the light-dependent synthesis of visual chromophore (Radu et al., Reference Radu, Hu, Peng, Bok, Mata and Travis2008).

RRH (RPE-derived rhodopsin homolog), also known as peropsin, is localized in the microvilli of RPE cells that surround photoreceptor outer segments (Sun et al., Reference Sun, Gilbert, Copeland, Jenkins and Nathans1997). It is another protein that preferentially binds to all-trans-retinal (Cook et al., Reference Cook, Ng, Lloyd, Eddington, Sun, Nathans, Bok, Radu and Travis2017). Although not much information is known about RRH, it can reasonably be inferred that it photoisomerizes all-trans-retinal to 11-cis-retinal and may play a role in the upkeep of photoreceptor functions.

Microbial opsins are transmembrane ion pumps or channels (Findlay and Pappin, Reference Findlay and Pappin1986). They are also 7TM, though this is due to convergent evolution rather than homology (Yee et al., Reference Yee, Shlykov, Vastermark, Reddy, Arora, Sun and Saier2013). The chromophore, retinal, usually isomerizes from all-trans to 13-cis, different from the case of animal opsins (Findlay and Pappin, Reference Findlay and Pappin1986). In addition, microbial opsins often form oligomers to carry out their functions (Gmelin et al., Reference Gmelin, Zeth, Efremov, Heberle, Tittor and Oesterhelt2007).

In this study, I select three microbial opsins: BACR (UniProt ID: P02945), BACH (UniProt ID: B0R2U4), and ChR2 (UniProt ID: Q8RUT8).

BACR (bacteriorhodopsin) is a light-driven proton pump. It is one of the first microbial opsins discovered (Oesterhelt and Stoeckenius, Reference Oesterhelt and Stoeckenius1971). BACH (halorhodopsin) is a light-driven chloride pump activated by yellow light (Schobert and Lanyi, Reference Schobert and Lanyi1982). ChR2 (channelrhodopsin 2) is a light-activated sodium channel activated by blue light (Nagel et al., Reference Nagel, Szellas, Huhn, Kateriya, Adeishvili, Berthold, Ollig, Hegemann and Bamberg2003). BACH and ChR2 are among the first optogenetic tools (Han and Boyden, Reference Han and Boyden2007; Zhang et al., Reference Zhang, Wang, Brauner, Liewald, Kay, Watzke, Wood, Bamberg, Nagel, Gottschalk and Deisseroth2007). BACH is used for inhibition, while ChR2 is used for excitation.

I have asked whether retinylidene proteins could be redesigned to be more soluble. Retinylidene proteins are all integral membrane proteins with seven transmembrane alpha helices embedded in a lipid bilayer. Because of the hydrophobic properties of the transmembrane domains, they are not water-soluble without the aid of detergents.

Bacteriorhodopsin has been the subject of several protein-solubilizing studies, though with limited success (Qing et al., Reference Qing, Hao, Smorodina, Jin, Zalevsky and Zhang2022). Recently, researchers leveraged a neural network, SolubleMPNN, which was built upon ProteinMPNN, to engineer soluble variants of bacteriorhodopsin while maintaining its ligand-binding ability and light-sensing function (Nikolaev et al., Reference Nikolaev, Orlov, Tsybrov, Kuznetsova, Shishkin, Kuzmin, Mikhailov, Nikolaeva, Anuchina, Chizhov, Semenov, Kapranov, Borshchevskiy, Remeeva and Gushchin2025).

Instead of taking a computational approach, I apply the QTY (glutamine, threonine, tyrosine) code to systematically engineer water-soluble analogues with reduced hydrophobicity in membrane proteins. There are structural similarities between hydrophobic and polar amino acids: leucine (L) and glutamine (Q); isoleucine (I)/valine (V) and threonine (T); and phenylalanine (F) and tyrosine (Y), as can be observed on high-resolution electron density maps (Tegler et al., Reference Tegler, Corin, Pick, Brookes, Skuhersky, Vogel and Zhang2020; Zhang and Egli, Reference Zhang and Egli2022; Zhang et al., Reference Zhang, Tao, Qing, Tang, Skuhersky, Corin, Tegler, Wassie, Wassie, Kwon, Suter, Entzian, Schubert, Yang, Labahn, Kubicek and Maertens2018). This justifies the replacement of hydrophobic amino acids with polar ones. Zhang et al. initially applied the QTY code to design several detergent-free chemokine receptors, all of which retained structural thermal stability and native ligand-binding and enzymatic activities despite substantial changes to the transmembrane domain (Tegler et al., Reference Tegler, Corin, Pick, Brookes, Skuhersky, Vogel and Zhang2020; Zhang et al., Reference Zhang, Tao, Qing, Tang, Skuhersky, Corin, Tegler, Wassie, Wassie, Kwon, Suter, Entzian, Schubert, Yang, Labahn, Kubicek and Maertens2018). Zhang’s team also applied the QTY code to design water-soluble GPCRs, including chemokine receptors (Qing et al., Reference Qing, Han, Skuhersky, Chung, Badr, Schubert and Zhang2019; Skuhersky et al., Reference Skuhersky, Tao, Qing, Smorodina, Jin and Zhang2021; Tegler et al., Reference Tegler, Corin, Pick, Brookes, Skuhersky, Vogel and Zhang2020; Zhang et al., Reference Zhang, Tao, Qing, Tang, Skuhersky, Corin, Tegler, Wassie, Wassie, Kwon, Suter, Entzian, Schubert, Yang, Labahn, Kubicek and Maertens2018), cytokine receptors (Hao et al., Reference Hao, Jin, Zhang and Qing2020), and olfactory receptors (Johnsson et al., Reference Johnsson, Karagöl, Karagöl and Zhang2025; Skuhersky et al., Reference Skuhersky, Tao, Qing, Smorodina, Jin and Zhang2021). In addition to GPCR, they used the QTY code to design water-soluble glucose transporters (Smorodina et al., Reference Smorodina, Tao, Qing, Jin, Yang and Zhang2022), ABC transporters (Pan et al., Reference Pan, Tao, Smorodina and Zhang2024), monoamine neurotransmitter transporters (Karagöl et al., Reference Karagöl, Karagöl and Zhang2024c), glutamate transporters (Karagöl et al., Reference Karagöl, Karagöl, Smorodina and Zhang2024a, Reference Karagöl, Karagöl and Zhang2024b), mitochondrial megacomplex (Chen and Zhang, Reference Chen and Zhang2025), antibodies (Li et al., Reference Li, Wang, Tao, Xu and Zhang2024b), potassium channels (Smorodina et al., Reference Smorodina, Tao, Qing, Yang and Zhang2024), receptor kinases (Li et al., Reference Li, Tang, Qing, Wang, Liu, Wang, Lyu, Ma, Xu, Zhang and Tao2024a), transmembrane enzymes (Chen et al., Reference Chen, Pan and Zhang2025), as well as bacterial enzymes with beta barrel structures (Sajeev-Sheeja and Zhang, Reference Sajeev-Sheeja and Zhang2024; Sajeev-Sheeja et al., Reference Sajeev-Sheeja, Smorodina and Zhang2023). The QTY-designed water-soluble CXCR4 chemokine receptor was successfully used in biomimetic sensors (Qing et al., Reference Qing, Xue, Zhao, Wu, Breitwieser, Smorodina, Schubert, Azzellino, Jin, Kong, Palacios, Sleytr and Zhang2023).

AlphaFold2 was released by Google DeepMind in July 2021 (Jumper et al., Reference Jumper, Evans, Pritzel, Green, Figurnov, Ronneberger, Tunyasuvunakool, Bates, Žídek, Potapenko, Bridgland, Meyer, Kohl, Ballard, Cowie, Romera-Paredes, Nikolov, Jain, Adler, Back, Petersen, Reiman, Clancy, Zielinski, Steinegger, Pacholska, Berghammer, Bodenstein, Silver, Vinyals, Senior, Kavukcuoglu, Kohli and Hassabis2021). It has greatly facilitated the study of QTY-designed protein variants. Subsequently, AlphaFold3 was released in May 2024, featuring improved architecture and improved efficiency (Abramson et al., Reference Abramson, Adler, Dunger, Evans, Green, Pritzel, Ronneberger, Willmore, Ballard, Bambrick, Bodenstein, Evans, Hung, O’Neill, Reiman, Tunyasuvunakool, Wu, Žemgulytė, Arvaniti, Beattie, Bertolli, Bridgland, Cherepanov, Congreve, Cowen-Rivers, Cowie, Figurnov, Fuchs, Gladman, Jain, Khan, Low, Perlin, Potapenko, Savy, Singh, Stecula, Thillaisundaram, Tong, Yakneen, Zhong, Zielinski, Žídek, Bapst, Kohli, Jaderberg, Hassabis and Jumper2024). Furthermore, AlphaFold3 enabled the accurate prediction of complexes of multiple proteins, as well as complexes with modified residues, nucleic acids, ions, and certain ligands. QTY studies have made use of these new features (Chen and Zhang, Reference Chen and Zhang2025; Johnsson et al., Reference Johnsson, Karagöl, Karagöl and Zhang2025).

GROMACS is a molecular dynamics (MD) simulation program released in 1995 (Berendsen et al., Reference Berendsen, Van Der Spoel and Van Drunen1995). Its 5.0 version was released in 2015 (Abraham et al., Reference Abraham, Murtola, Schulz, Páll, Smith, Hess and Lindahl2015). GROMACS enables an efficient and realistic simulation of biomolecular systems and has been used to investigate the structural and functional properties of QTY-designed protein variants in various studies (Johnsson et al., Reference Johnsson, Karagöl, Karagöl and Zhang2025; Karagöl et al., Reference Karagöl, Karagöl and Zhang2024b; Li et al., Reference Li, Tang, Qing, Wang, Liu, Wang, Lyu, Ma, Xu, Zhang and Tao2024a, Reference Li, Wang, Tao, Xu and Zhang2024b; Smorodina et al., Reference Smorodina, Tao, Qing, Yang and Zhang2024).

In this article, I apply the QTY code to redesign nine human opsins and three microbial opsins. I provide the superpositions of the AlphaFold3-predicted hydrophobic native proteins and their water-soluble QTY variants, and experimentally determined structures when available. I also provide a comparison of the surface hydrophobicity of the variants. Furthermore, I run MD simulations of native and QTY-designed OPN2 and analyze their response to the 11-cis to all-trans isomerization of the retinal chromophore.

Methods

QTY design, protein sequence alignment, and other characteristics

The native protein sequences for OPN1MW, OPN1LW, OPN1SW, OPN2, OPN3, OPN4, OPN5, RGR, RRH, BACR, BACH, and ChR2 were obtained from UniProt (https://www.uniprot.org). The sequence for ChR2 was truncated according to the experimentally determined structure in the RCSB PDB (PDB ID: 8ZAN). The QTY designs were performed through the protein solubilizing server (https://pss.sjtu.edu.cn/) (Tao et al., Reference Tao, Tang, Zhang, Li and Xu2022). For ChR2, the secondary structure was provided to the server in SS3 format according to the RCSB PDB structure.

AlphaFold3 predictions

I predicted the structures of native proteins and their QTY variants using the AlphaFold3 website (https://alphafoldserver.com) (Abramson et al., Reference Abramson, Adler, Dunger, Evans, Green, Pritzel, Ronneberger, Willmore, Ballard, Bambrick, Bodenstein, Evans, Hung, O’Neill, Reiman, Tunyasuvunakool, Wu, Žemgulytė, Arvaniti, Beattie, Bertolli, Bridgland, Cherepanov, Congreve, Cowen-Rivers, Cowie, Figurnov, Fuchs, Gladman, Jain, Khan, Low, Perlin, Potapenko, Savy, Singh, Stecula, Thillaisundaram, Tong, Yakneen, Zhong, Zielinski, Žídek, Bapst, Kohli, Jaderberg, Hassabis and Jumper2024). For microbial opsins, the dimers and trimers have identical subunits, that is, are homodimers and homotrimers. In AlphaFold3, this was achieved by altering the number of protein copies.

Structure superpositions

PDB files for native protein structures determined experimentally by X-ray diffraction or electron microscopy were taken from RCSB PDB: OPN2 (PDB ID: 5W0P), BACR (PDB ID: 7XJC), BACH (PDB ID: 2JAF), ChR2 (PDB ID: 8ZAN). The AlphaFold3-predicted native and QTY variants were taken directly from the most probable predicted structure. Superpositions were performed via the command ‘super’ in PyMOL (https://pymol.org). I removed unstructured loops at the N and C terminals for the sake of clarity.

Structure visualization

I used PyMOL (https://pymol.org) to superpose the native predicted protein structures, their QTY variants, and the experimentally determined structures for the proteins where these existed. I then used UCSF ChimeraX (https://www.cgl.ucsf.edu/chimerax/) to render each protein model with hydrophobicity patches.

Molecular dynamics simulations

All MD simulations and analyses were executed on a desktop computer with Intel Xeon Platinum 8352 V Processor, 256 GB RAM, and 2 NVIDIA GPUs (GeForce RTX 4090) with 24 GB VRAM each. All MD simulations were conducted using GROMACS 2024.5 (Abraham et al., Reference Abraham, Murtola, Schulz, Páll, Smith, Hess and Lindahl2015) with the CHARMM36m all-atom force field (Huang et al., Reference Huang, Rauscher, Nawrocki, Ran, Feig, de Groot, Grubmüller and MacKerell2017). Data for retinal were obtained from NAMD Wiki (https://www.ks.uiuc.edu/Research/namd/wiki/index.cgi?RetinalTop for topology and https://www.ks.uiuc.edu/Research/namd/wiki/index.cgi?RetinalPar for parameters) and manually integrated into the protein file. Initial structures, configuration files, and command-line codes for the simulations are publicly available on Zenodo (https://doi.org/10.5281/zenodo.15377505).

Regarding native OPN2, the protein membrane system was constructed using the web-based membrane builder CHARMM-GUI and downloaded in GROMACS format (Jo et al., Reference Jo, Kim, Iyer and Im2008; Lee et al., Reference Lee, Cheng, Swails, Yeom, Eastman, Lemkul, Wei, Buckner, Jeong, Qi, Jo, Pande, Case, Brooks, MacKerell, Klauda and Im2016; Wu et al., Reference Wu, Cheng, Jo, Rui, Song, Dávila-Contreras, Qi, Lee, Monje-Galvan, Venable, Klauda and Im2014). The protein was centered in a rectangular box. The generated membrane models consisted of 40% POPC, 40% POPE, 10% POPS, and 10% cholesterol, which simulated a rod photoreceptor disk membrane where native OPN2 is usually expressed (Albert and Boesze-Battaglia, Reference Albert and Boesze-Battaglia2005). The system was solvated in TIP3P water with 150 mM NaCl. NaCl was used instead of KCl to simulate the environment of rod photoreceptor outer segments (Govardovskii, Reference Govardovskii1971). 11-cis-Retinal was manually added to the protein files. LINCS constraints were used for constraints, and the Verlet integrator was used. Electrostatics was handled with particle mesh Ewald (PME), with both Coulomb and van der Waals interaction cutoffs set at 1.2 nm. The energy of the system was minimized using the steepest descent until the maximum forces converged below 1000 kJ/mol/nm. The standard six-step CHARMM-GUI NP $ \gamma $ T equilibration protocol (Jo et al., Reference Jo, Kim, Iyer and Im2008) was used, with two 125-ps NVT equilibration simulations, and one 125-ps and three 500-ps NP $ \gamma $ T equilibration simulations. Temperature and pressure were maintained at 303.15 K and 1.0 bar, respectively, using the V-rescale thermostat and C-rescale barostat with surface tension coupling. Following equilibration, a 10-ns production MD simulation was run. The retinal molecule was then manually switched to the all-trans state by rotating C13 to C15 and associated hydrogen and methyl groups by 180 degrees around the C11–C12 bond. This mimicked the effect of the absorption of a photon by 11-cis-retinal. Energy minimization and equilibration were repeated exactly as before, and then a 120-ns production MD simulation was run.

Regarding QTY-designed OPN2, I found that helix 9 of the QTY analogue flung up and adhered to the surface that was originally the transmembrane domain, which disrupted the normal conformation. Therefore, this helix was truncated. The system was constructed directly using GROMACS. The protein was centered in a rectangular box with dimensions equal to those of native OPN2. The equilibration involved a sequence of simulations: one 250-ps NVT equilibration, followed by one 125-ps and one 1500-ps NPT equilibration, totaling an equilibration duration that matches that of the natural OPN2. All other configurations and parameters were identical to those of the original OPN2.

After the simulations, I extracted frames from the trajectory to inspect the structures before and after retinal isomerization. I also calculated the RMSD (‘gmx rms’ command), number of hydrogen bonds (‘gmx hbond’ command) and water molecules (‘gmx select’ command) inside the binding pocket, interaction energy (including short-range Coulombic interaction energy and short-range Lennard-Jones energy) (‘gmx energy’ command), the RMSF (‘gmx rmsf’ command), and radius of gyration (‘gmx gyrate’ command). The detailed commands are publicly available in the online database for this study.

Results and discussion

The rationale of the QTY code

The transmembrane segments of membrane proteins are hydrophobic in nature. This leads to a challenge to study their structure and function. Traditionally, detergent must be used to purify them. Zhang et al. considered a different approach, that is, via systematic soluble protein design. There exist structural similarities, as can be observed on high-resolution electron density maps, between certain hydrophobic and hydrophilic amino acids: leucine (L) and glutamine (Q), isoleucine (I) or valine (V) and threonine (T), and phenylalanine (F) with tyrosine (Y). This fact enables systematic replacement of L with Q, I/V with T, and F with Y in all transmembrane segments of the membrane protein, which Zhang et al. named the QTY code. QTY analogues are less hydrophobic than native membrane proteins. Although their amino acid sequences are significantly changed, they still exhibit relatively preserved structure, isoelectric points (pI), and molecular weights (MW) compared to the native membrane proteins (Table 1).

Table 1. Protein characteristics of 12 retinylidene proteins and their QTY analogs

MW, molecular weight; pI, isoelectric focusing; RMSD, root-mean-square distance; TM, transmembrane.

Protein sequence alignments and other characteristics of retinylidene proteins

The protein sequences of nine human opsins and three microbial opsins were aligned with their QTY analogues (Figure 1). The QTY substitution resulted in an overall variation of the sequence of 15.48% to 30.04% and a variation of the transmembrane domain of 35.53% to 50.24%. Despite changes in amino acid composition and sequence, the pI only experienced a slight change, between 0.00 and 0.16. This is because the amino acids Q, T, and Y have neutral charges and do not introduce additional charges to the system. The MW increased slightly by a value between 0.04 and 0.60 kDa. This is because the hydrophilic amino acids usually have nitrogen or oxygen atoms in place of carbon atoms in hydrophobic acids and thus may have higher mass: L (131.17 Da) versus Q (146.14 Da); I (131.17 Da) and V (117.15 Da) versus T (119.12 Da); F (165.19 Da) versus Y (181.19 Da).

Figure 1. The protein sequence alignments of 12 retinylidene proteins and their QTY analogs. Blue tables: human opsins; purple tables: microbial opsins. The symbols $ \mid $ and $ \ast $ indicate that amino acids are identical or different, respectively. Amino acids L, I/V, and F in TM (transmembrane) alpha helices (shown in blue above the sequences) are replaced with Q, T, and Y, respectively. The variation in the TM domain ranges from 35.53% to 50.24%, while the overall variation rate ranges from 15.48% to 30.04%. The characteristics of the proteins are shown above the sequences. Despite large variation rates, the pI only experiences a slight change between 0.00 and 0.16 and the MW increases slightly by a value between 0.04 and 0.60 kDa. The enlarged individual sequence alignments are available in supplementary information. The alignments are (a) OPN1MW versus OPN1MW $ {}^{\mathrm{QTY}} $ , (b) OPN1LW versus OPN1LW $ {}^{\mathrm{QTY}} $ , (c) OPN1SW versus OPN1SW $ {}^{\mathrm{QTY}} $ , (d) OPN2 versus OPN2 $ {}^{\mathrm{QTY}} $ , (e) OPN3 versus OPN3 $ {}^{\mathrm{QTY}} $ , (f) OPN4 versus OPN4 $ {}^{\mathrm{QTY}} $ , (g) OPN5 versus OPN5 $ {}^{\mathrm{QTY}} $ , (h) RGR versus RGR $ {}^{\mathrm{QTY}} $ , (i) RRH versus RRH $ {}^{\mathrm{QTY}} $ , (j) BACR versus BACR $ {}^{\mathrm{QTY}} $ , (k) BACH versus BACH $ {}^{\mathrm{QTY}} $ , and (l) ChR2 versus ChR2 $ {}^{\mathrm{QTY}} $ .

Superpositions of AlphaFold3-predicted native human opsins, their water-soluble QTY variants, and experimentally determined structures

Although the chromophore, retinal, is bound to retinylidene proteins in an internal pocket, it is only incorporated into the protein after protein folding is complete. In other words, protein folding does not depend on the chromophore. Thus, it is justified to predict the ligand-free forms of the proteins. I used AlphaFold3 to predict the structure of the nine native human opsins and their water-soluble QTY analogues. The structures superposed very well (Figure 2ai). For OPN2, which has an available X-ray diffraction structure, I also superposed the experimentally determined structure with the AlphaFold3-predicted structures. Overall, the root mean square distances (RMSD) were small, from 0.307 to 0.611 Å, with only one exception of OPN2 $ {}^{\mathrm{QTY}} $ versus OPN2 $ {}^{\mathrm{EXP}} $ (RMSD = 0.999 Å). This shows that the water-soluble QTY analogues are quite similar to native proteins in terms of structure.

Figure 2. Superposition of AlphaFold3-predicted native human retinylidene proteins, their QTY analogs, and experimentally determined structures. For clarity, unstructured N- and C-terminal ends are deleted. For (a) to (i), despite significant changes in the protein sequence, the structures superpose very well. The root-mean-square distance (RMSD) values are quite small, from 0.307 to 0.611 Å, with only one exception (OPN2 $ {}^{\mathrm{QTY}} $ vs. OPN2 $ {}^{\mathrm{EXP}} $ , RMSD = 0.999 Å). Green: AlphaFold3-predicted native structure; cyan: AlphaFold3-predicted QTY analog structure; magenta: experimentally determined structure. The superpositions are (a) OPN1MW $ {}^{\mathrm{AF}3} $ versus OPN1MW $ {}^{\mathrm{QTY}} $ , (b) OPN1LW $ {}^{\mathrm{AF}3} $ versus OPN1LW $ {}^{\mathrm{QTY}} $ , (c) OPN1SW $ {}^{\mathrm{AF}3} $ versus OPN1SW $ {}^{\mathrm{QTY}} $ , (d) OPN2 $ {}^{\mathrm{AF}3} $ versus OPN2 $ {}^{\mathrm{QTY}} $ versus OPN2 $ {}^{\mathrm{EXP}} $ , (e) OPN3 $ {}^{\mathrm{AF}3} $ versus OPN3 $ {}^{\mathrm{QTY}} $ , (f) OPN4 $ {}^{\mathrm{AF}3} $ versus OPN4 $ {}^{\mathrm{QTY}} $ , (g) OPN5 $ {}^{\mathrm{AF}3} $ versus OPN5 $ {}^{\mathrm{QTY}} $ , (h) RGR $ {}^{\mathrm{AF}3} $ versus RGR $ {}^{\mathrm{QTY}} $ , and (i) RRH $ {}^{\mathrm{AF}3} $ versus RRH $ {}^{\mathrm{QTY}} $ . For (j) and (k), there is a large degree of similarity between the RMSD between a pair of native proteins and that between the corresponding pair of QTY analogs. Green: OPN1MW; red: OPN1LW; blue: OPN1SW; purple: OPN2; cyan: OPN3; gray: OPN4; olive: OPN5; orange: RGR; pink: RRH. The superpositions are (j) OPN1MW $ {}^{\mathrm{AF}3} $ versus OPN1LW $ {}^{\mathrm{AF}3} $ versus OPN1SW $ {}^{\mathrm{AF}3} $ versus OPN2 $ {}^{\mathrm{AF}3} $ versus OPN3 $ {}^{\mathrm{AF}3} $ versus OPN4 $ {}^{\mathrm{AF}3} $ versus OPN5 $ {}^{\mathrm{AF}3} $ versus RGR $ {}^{\mathrm{AF}3} $ versus RRH $ {}^{\mathrm{AF}3} $ and (k) OPN1MW $ {}^{\mathrm{QTY}} $ versus OPN1LW $ {}^{\mathrm{QTY}} $ versus OPN1SW $ {}^{\mathrm{QTY}} $ versus OPN2 $ {}^{\mathrm{QTY}} $ versus OPN3 $ {}^{\mathrm{QTY}} $ versus OPN4 $ {}^{\mathrm{QTY}} $ versus OPN5 $ {}^{\mathrm{QTY}} $ versus RGR $ {}^{\mathrm{QTY}} $ versus RRH $ {}^{\mathrm{QTY}} $ .

All nine of these human opsins belong to the same large family, that is, animal opsins. However, they belong to different subfamilies and have structural differences from each other. In order to investigate the extent to which the water-soluble QTY analogues preserve these differences, I also conducted pairwise superpositions within all nine native human opsins and within all nine QTY analogues of human opsins (Figure 2j and k). There was a large degree of similarity between the RMSD between a pair of native proteins and that between the corresponding pair of QTY analogues. Therefore, I conclude that the structural differences are relatively well preserved before and after the QTY substitution. However, whether the functional differences are preserved remains a problem to study.

Superpositions of AlphaFold3-predicted native microbial opsins, their water-soluble QTY variants, and experimentally determined structures

I used AlphaFold3 to predict the structure of monomers of the three native microbial opsins, their water-soluble QTY analogues. Since microbial opsins often have to form oligomers to be functional, I also predicted the structure of native and QTY variants of BACR trimers, BACH trimers, and ChR2 dimers. These predicted structures were also superposed with experimentally determined structures (X-ray diffraction or electron microscopy). The structures superposed very well (Figure 3). Overall, the root-mean-square distances (RMSD) were small, with the highest value being 0.685 Å. This shows that the water-soluble QTY analogues are likely capable of forming oligomers with structures similar to those of native proteins.

Figure 3. Superposition of AlphaFold3-predicted native microbial retinylidene proteins, their QTY analogs, and experimentally determined structures. Despite significant changes in the protein sequence, the structures superpose very well. The root-mean-square distance (RMSD) values are quite small, with the highest being 0.685 Å. For clarity, unstructured N- and C-terminal ends are deleted. Green: AlphaFold3-predicted native structure; cyan: AlphaFold3-predicted QTY analog structure; magenta: experimentally determined structure. The superpositions are (a) BACR $ {}^{\mathrm{AF}3} $ versus BACR $ {}^{\mathrm{QTY}} $ versus BACR $ {}^{\mathrm{EXP}} $ monomer, (b) BACR $ {}^{\mathrm{AF}3} $ versus BACR $ {}^{\mathrm{QTY}} $ versus BACR $ {}^{\mathrm{EXP}} $ trimer, (c) BACH $ {}^{\mathrm{AF}3} $ versus BACH $ {}^{\mathrm{QTY}} $ versus BACH $ {}^{\mathrm{EXP}} $ monomer, (d) BACH $ {}^{\mathrm{AF}3} $ versus BACH $ {}^{\mathrm{QTY}} $ versus BACH $ {}^{\mathrm{EXP}} $ trimer, (e) ChR2 $ {}^{\mathrm{AF}3} $ versus ChR2 $ {}^{\mathrm{QTY}} $ versus ChR2 $ {}^{\mathrm{EXP}} $ monomer, and (f) ChR2 $ {}^{\mathrm{AF}3} $ versus ChR2 $ {}^{\mathrm{QTY}} $ versus ChR2 $ {}^{\mathrm{EXP}} $ dimer.

Analysis of the hydrophobic surface of native retinylidene proteins and their water-soluble QTY variants

As transmembrane proteins, retinylidene proteins require detergents to be separated from the lipid bilayer. The detergents act as an interface between the hydrophobic transmembrane domain of the protein and the surrounding water, thus solubilizing the protein. If these proteins are exposed in water without detergents, their hydrophobicity will induce them to aggregate and precipitate, with their structures and functions disrupted.

The hydrophobic portions of the protein surface are represented in yellow (Figure 4). In the native proteins, there are large hydrophobic patches in the transmembrane domains, which are embedded within the lipid bilayer. The nonpolar and hydrophobic amino acids, including leucine (L), isoleucine (I), valine (V), phenylalanine (F), alanine (A), methionine (M), and tryptophan (W), interact with the hydrophobic hydrocarbon chains and expel water.

Figure 4. Hydrophobic surface of 12 retinylidene proteins and their water-soluble QTY analogs. Hydrophobic patches are shown in yellow, while hydrophilic patches are shown in cyan. The native proteins have many hydrophobic patches due to the presence of hydrophobic amino acids, including L, I, V, and F. After QTY substitution, hydrophilic Q, T, and Y have respectively replaced hydrophobic L, I/V, and F, and the hydrophobic patches in the surface of transmembrane helices have become more hydrophilic. In addition, the surface shape of the native and QTY analogs are very similar. For clarity, unstructured N- and C-terminal ends are deleted. The comparisons are (a) OPN1MW versus OPN1MW $ {}^{\mathrm{QTY}} $ , (b) OPN1LW versus OPN1LW $ {}^{\mathrm{QTY}} $ , (c) OPN1SW versus OPN1SW $ {}^{\mathrm{QTY}} $ , (d) OPN2 versus OPN2 $ {}^{\mathrm{QTY}} $ , (e) OPN3 versus OPN3 $ {}^{\mathrm{QTY}} $ , (f) OPN4 versus OPN4 $ {}^{\mathrm{QTY}} $ , (g) OPN5 versus OPN5 $ {}^{\mathrm{QTY}} $ , (h) RGR versus RGR $ {}^{\mathrm{QTY}} $ , (i) RRH versus RRH $ {}^{\mathrm{QTY}} $ , (j) BACR versus BACR $ {}^{\mathrm{QTY}} $ monomer, (k) BACH versus BACH $ {}^{\mathrm{QTY}} $ monomer, (l) ChR2 versus ChR2 $ {}^{\mathrm{QTY}} $ monomer, (m) BACR versus BACR $ {}^{\mathrm{QTY}} $ trimer, (n) BACH versus BACH $ {}^{\mathrm{QTY}} $ trimer, and (o) ChR2 versus ChR2 $ {}^{\mathrm{QTY}} $ dimer.

I investigated the surface hydrophobicity of water-soluble QTY analogues in comparison with the native proteins. After applying the QTY code to, respectively, replace the hydrophobic amino acids L, I/V, and F with hydrophilic amino acids glutamine (Q), threonine (T), and tyrosine (Y), the hydrophobic surface areas were significantly reduced. More importantly, since the electron density maps of the corresponding amino acids are similar, the alpha helices in QTY analogues had a similar contour to those in the native proteins. The QTY analogues successfully retained their structural integrity and stability.

AlphaFold3 predictions

DeepMind released AlphaFold3 in May 2024, marking a significant leap in accuracy for modeling across biomolecular space. This latest iteration outperforms state-of-the-art docking tools and its predecessor, AlphaFold-Multimer v.2.3, in protein structure and protein–protein interaction predictions (Abramson et al., Reference Abramson, Adler, Dunger, Evans, Green, Pritzel, Ronneberger, Willmore, Ballard, Bambrick, Bodenstein, Evans, Hung, O’Neill, Reiman, Tunyasuvunakool, Wu, Žemgulytė, Arvaniti, Beattie, Bertolli, Bridgland, Cherepanov, Congreve, Cowen-Rivers, Cowie, Figurnov, Fuchs, Gladman, Jain, Khan, Low, Perlin, Potapenko, Savy, Singh, Stecula, Thillaisundaram, Tong, Yakneen, Zhong, Zielinski, Žídek, Bapst, Kohli, Jaderberg, Hassabis and Jumper2024). AlphaFold3 reduces the reliance on multiple sequence alignment by integrating a diffusion-based model, enabling it to predict a broader spectrum of biomolecules, including ligands, ions, nucleic acids, modified residues, and large protein megacomplexes. On October 9, 2024, DeepMind’s founders, Demis Hassabis and John Jumper, received the Nobel Prize in Chemistry for revolutionizing protein structure prediction through leveraging machine learning.

AlphaFold3 is easily accessible online (https://alphafoldserver.com), allowing users to make 30 predictions a day at present. The structures of the QTY analogues were predicted using the AlphaFold3 server, which was free of charge, and the results were produced within a few minutes.

DeepMind also collaborated with the EBI to make over 214 million predicted protein structures available through the AlphaFold Protein Structure Database (https://alphafold.ebi.ac.uk). This number is continuously expanding, with the quality of predictions further improving with the advent of AlphaFold 3.

However, AlphaFold3 still has limitations that have been encountered in this study. Although AlphaFold3 allows the structural prediction of complexes that include small-molecule ligands, it only supports a small range of ligands, which does not include retinal. I was therefore unable to predict the structure of chromophore-bound states of retinylidene proteins with AlphaFold3. Consequently, I chose another approach, namely molecular dynamics simulation, in order to investigate the behavior of QTY opsin analogues when they are bound to the chromophore.

Simulation of 11-cis to all-trans isomerization of retinal in native OPN2 and its QTY analogue

QTY substitutions do not introduce changes to the essential NPxxY motif or the retinal-binding lysine of human retinylidene proteins. Therefore, I proposed that QTY analogues of human opsins may have conserved functions and carried out a molecular dynamics (MD) simulation. I recorded and analyzed the behavior of native OPN2 and its QTY analogue before and after the 11-cis to all-trans isomerization. Note that my aim is not to simulate the whole photocycle of OPN2, which takes longer than 1 ms (Fanelli et al., Reference Fanelli, Felline and Marigo2021), but rather to find proof that the QTY analogue conserves the function of being activated by isomerization of retinal.

The MD simulation provided evidence of conformational change in both native and QTY analogues (Figure 5). The native protein experienced a slower change from its original conformation, while the QTY analogue showed more fluctuations and sudden changes. Under further inspection, this phenomenon might be due to the entrance of water molecules into the QTY analogue. In addition, the retinal molecule had a greater degree of change in conformation in the native protein than in the QTY analogue. Nevertheless, the retinal in the QTY analogue was still able to cause changes to surrounding residues, sometimes doing so via intervening water molecules. It may be noted that the RMSD was still fluctuating, which means that the protein had not yet stabilized. This was expected, since the OPN2 was still in the process of transition from BATH to LUMI state, and this provided evidence for the activation of the protein in response to isomerization of retinal.

Figure 5. The conformational changes of native OPN2 and its QTY analog before and after 11-cis to all-trans isomerization of the chromophore, retinal. (a, b) 1 ns running averages of the root-mean-square distances (RMSD) of the protein–retinal complex, transmembrane helix 6 (TM6), the retinal-binding pocket, and retinal. By convention, the isomerization is set at time 0 ns, which is indicated by a brown, vertical dashed line. (c) Superpositions between cis-state OPN2, trans-state OPN2, cis-state OPN2 $ {}^{\mathrm{QTY}} $ , and trans-state OPN2 $ {}^{\mathrm{QTY}} $ . Both OPN2 and OPN2 $ {}^{\mathrm{QTY}} $ exhibit conformational changes, with RMSDs greater than 2 Å. Blue: cis-state protein, orange: 11-cis-retinal; yellow: trans-state protein; greenish cyan: all-trans retinal.

Zooming in on the retinal-binding pocket, I observed that retinal is held within its binding pocket by several forces in both the native and QTY analogue: the counterions of the retinal Schiff base (RSB) (113E and 181E), the hydrophobic interactions with neighboring residues (e.g., 265W, 268Y, and the 187–189 beta strand), and, sometimes, steric collisions (Figure 6). Since retinal is a ligand that binds in an internal pocket, I was unable to calculate the Gibbs free energy of its binding. Nevertheless, the distributions of interaction energies were found to be similar in native OPN2 and the water-soluble QTY analogue, except for small differences: native OPN2 experienced a change in interaction energy at about $ t=50 $ ns due to the entrance of water near 113E; the QTY analogue had more negative interaction energy at residues 208 and 212 due to phenylalanine (F) to tyrosine (Y) substitutions. Aside from these, the similarity was a strong suggestion of functional conservation. Finally, I observed very different hydrogen bonding and water molecule patterns inside the binding pocket, which was mainly due to several F to Y substitutions in that region. This might cause retinal to interact with adjacent residues in a subtly different way, and might lead to a different absorption peak since the environment of the RSB, that is, the binding pocket, determines the spectral characteristic of opsins (Fenno et al., Reference Fenno, Yizhar and Deisseroth2011).

Figure 6. Changes in the retinal-binding pocket and protein–ligand interaction in native OPN2 and its QTY analog before and after 11-cis to all-trans isomerization of the chromophore. (a, b) Close-ups of the binding pocket in cis-state. Protein–ligand interactions with lengths shorter or equal to 3.5 Å are shown in the figure. Blue: protein residues, orange: 11-cis-retinal; green dashed lines: ion bridge; yellow dashed lines: van der Waals and/or hydrophobic interactions. (c, d) Close-ups of the binding pocket in trans-state. Protein–ligand interactions with lengths shorter or equal to 3.5 Å are shown in the figure. Yellow: protein residues, greenish cyan: all-trans-retinal; green dashed lines: ion bridge; yellow dashed lines: van der Waals and/or hydrophobic interactions. (e, f) The interaction energies (IEs) between the protein, the binding pocket, individual residues, and retinal. IE is the sum of the short-range Coulombic interaction energy and short-range Lennard–Jones energy. Note that IE is a product of MD simulation and is not necessarily a ‘real’ physical quantity. For clarity, IE is rescaled using the signed pseudo logarithm ( $ y=\operatorname{sign}(x)\cdot \ln \left(|x|+1\right) $ ). By convention, the isomerization is set at time 0 ns, which is indicated by a brown, vertical dashed line. The large changes in IE in OPN2 around $ t=50 $ ns are due to the entrance of water molecules into the binding pocket, near 113E. The QTY analog has more negative IE at residues 208 and 212 due to F to Y substitutions. Besides from these, the similarity between the IE of OPN2 and OPN2 $ {}^{\mathrm{QTY}} $ is a strong suggestion of functional conservation. (g, h) The number of hydrogen bonds formed between residues in the binding pocket, and the number of water molecules within the binding pocket. By convention, the isomerization is set at time 0 ns, which is indicated by a brown, vertical dashed line.

Overall, I observed that QTY-designed OPN2 had relatively similar behavior to the native variant despite certain small differences caused by decreased hydrophobicity. The function of OPN2 $ {}^{\mathrm{QTY}} $ awaits to be verified by experiments.

Future scopes and potential applications

The findings of this study indicate that the QTY code could be used as a robust tool to design water-soluble retinylidene proteins.

This could, in the first place, facilitate the study of these proteins. The water solubility of QTY analogues makes it easier to purify and investigate proteins, especially those such as RGR and RRH, which have not been studied much but may have considerable clinical relevance. The water solubility also reduces the difficulty in recording the different phases of the photocycle, in comparison with native membrane opsins. Another application may be the rapid design of new optogenetic tools.

Furthermore, water-soluble QTY opsins could be useful clinically, since they could be delivered and/or expressed in the eye without forming aggregates and precipitating. It is not unreasonable to hypothesize that QTY opsins can still pass through the photocycle and interact with downstream signaling proteins. In light of this, QTY opsins may facilitate the optogenetic therapy of ophthalmological diseases (Sakai et al., Reference Sakai, Tomita and Maeda2022), providing a potential pathway in restoring basic vision for those who have lost it.

Finally, water-soluble opsins have the potential to be harvested in mass and may be used to design new biomimetic light-sensing systems, which may have applications in bioengineering.

Conclusion

In this study, I selected 12 retinylidene proteins, including 9 human opsins (OPN1MW, OPN1LW, OPN1SW, OPN2, OPN3, OPN4, OPN5, RGR, and RRH) and 3 microbial opsins (BACR, BACH, and ChR2), that had clinical implications and various potential applications. I applied the QTY code to convert the hydrophobic transmembrane alpha helices into hydrophilic ones and thus create water-soluble QTY analogues of the proteins. Then, I used AlphaFold3 to predict the structures of the native proteins as well as the QTY analogues, superposed them, and found that the structure of the proteins was well conserved despite substantial residue substitutions. I also inspected the surface hydrophobicity of the proteins and found a great decrease in hydrophobic patches on the transmembrane surface. Next, I chose a representative retinylidene protein, OPN2, and employed molecular dynamics simulations to investigate the behavior of the native and QTY analogue of OPN2. I found that the QTY analogue was capable of conformational change and effective protein–ligand interaction. These findings revealed that QTY-designed OPN2 and retinylidene proteins in general might have conserved structure and function. I believe that these water-soluble QTY variants may have potential in research, therapeutic treatments, and bioengineering.

Open peer review

To view the open peer review materials for this article, please visit http://doi.org/10.1017/qrd.2025.10009.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/qrd.2025.10009.

Data availability statement

The data for the AlphaFold3-predicted structures and MD simulation setting files and commands can be found on Zenodo (https://doi.org/10.5281/zenodo.15377505).

Acknowledgements

S.P. thanks Prof. Shuguang Zhang of the Lab of Molecular Architecture, Media Lab, Massachusetts Institute of Technology, for inspiring the topic of this study as well as providing valuable advice on research methods and manuscript preparation.

Author contribution

Conceptualization, formal analysis, investigation, methodology, validation, data curation, writing—original draft preparation, review and editing: S.P.

Financial support

There is no financial support for this digital bioinformatics study. I only used free tools that are publicly available.

Competing interests

Massachusetts Institute of Technology (MIT) filed several patent applications for the QTY code for GPCRs, which do not include the human opsins. There are no competing interests.

Ethics statement

All methods were carried out in accordance with relevant guidelines and regulations. Neither human biological samples nor human subjects were used in this study. This is a completely digital structural bioinformatic study using the publicly available AlphaFold3 machine learning program and GROMACS molecular dynamics simulation program.

References

Abraham, MJ, Murtola, T, Schulz, R, Páll, S, Smith, JC, Hess, B and Lindahl, E (2015) Gromacs: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2, 1925.Google Scholar
Abramson, J, Adler, J, Dunger, J, Evans, R, Green, T, Pritzel, A, Ronneberger, O, Willmore, L, Ballard, AJ, Bambrick, J, Bodenstein, SW, Evans, DA, Hung, C-C, O’Neill, M, Reiman, D, Tunyasuvunakool, K, Wu, Z, Žemgulytė, A, Arvaniti, E, Beattie, C, Bertolli, O, Bridgland, A, Cherepanov, A, Congreve, M, Cowen-Rivers, AI, Cowie, A, Figurnov, M, Fuchs, FB, Gladman, H, Jain, R, Khan, YA, Low, CMR, Perlin, K, Potapenko, A, Savy, P, Singh, S, Stecula, A, Thillaisundaram, A, Tong, C, Yakneen, S, Zhong, ED, Zielinski, M, Žídek, A, Bapst, V, Kohli, P, Jaderberg, M, Hassabis, D and Jumper, JM (2024) Accurate structure prediction of biomolecular interactions with alphafold 3. Nature 630(8016), 493500.Google Scholar
Albert, AD and Boesze-Battaglia, K (2005) The role of cholesterol in rod outer segment membranes. Progress in Lipid Research 44(2–3), 99124.Google Scholar
Baraas, RC, Hagen, LA, Dees, EW and Neitz, M (2012) Substitution of isoleucine for threonine at position 190 of s-opsin causes s-cone-function abnormalities. Vision Research 73, 19.Google Scholar
Berendsen, H, Van Der Spoel, D and Van Drunen, R (1995) Gromacs: A message-passing parallel molecular dynamics implementation. Computer Physics Communications 91(1–3), 4356.Google Scholar
Berson, DM, Dunn, FA and Takao, M (2002) Phototransduction by retinal ganglion cells that set the circadian clock. Science 295(5557), 10701073.Google Scholar
Blackshaw, S and Snyder, SH (1999) Encephalopsin: A novel mammalian extraretinal opsin discretely localized in the brain. The Journal of Neuroscience 19(10), 36813690.Google Scholar
Bowmaker, JK and Dartnall, HJ (1980) Visual pigments of rods and cones in a human retina. The Journal of Physiology 298(1), 501511.Google Scholar
Buhr, ED, Yue, WWS, Ren, X, Jiang, Z, Liao, H-WR, Mei, X, Vemaraju, S, Nguyen, M-T, Reed, RR, Lang, RA, Yau, K-W and Van Gelder, RN (2015) Neuropsin (opn5)-mediated photoentrainment of local circadian oscillators in mammalian retina and cornea. Proceedings of the National Academy of Sciences of the United States of America 112(42), 1309313098.Google Scholar
Chabre, M and Deterre, P (1989) Molecular mechanism of visual transduction. European Journal of Biochemistry 179(2), 255266.Google Scholar
Chen, E and Zhang, S (2025) Structural bioinformatic study of human mitochondrial respiratory integral membrane megacomplex and its alphafold3 predicted water-soluble qty megacomplex analog. QRB Discovery 6, e12.Google Scholar
Chen, E, Pan, E and Zhang, S (2025) Structure bioinformatics of six human integral transmembrane enzymes and their alphafold3 predicted water-soluble qty analogs: Insights into face1 and stea4 binding mechanisms. Pharmaceutical Research 42, 291305.Google Scholar
Cook, JD, Ng, SY, Lloyd, M, Eddington, S, Sun, H, Nathans, J, Bok, D, Radu, RA and Travis, GH (2017) Peropsin modulates transit of vitamin a from retina to retinal pigment epithelium. Journal of Biological Chemistry 292(52), 2140721416.Google Scholar
Copits, BA, Gowrishankar, R, O’Neill, PR, Li, JN, Girven, KS, Yoo, JJ, Meshik, X, Parker, KE, Spangler, SM, Elerding, AJ, Brown, BJ, Shirley, SE, Ma, KKL, Vasquez, AM, Stander, MC, Kalyanaraman, V, Vogt, SK, Samineni, VK, Patriarchi, T, Tian, L, Gautam, N, Sunahara, RK, Gereau, RW and Bruchas, M R (2021) A photoswitchable gpcr-based opsin for presynaptic inhibition. Neuron 109(11), 17911809.e11.Google Scholar
Fanelli, F, Felline, A and Marigo, V (2021) Structural aspects of rod opsin and their implication in genetic diseases. Pflügers Archiv - European Journal of Physiology 473(9), 13391359.Google Scholar
Fenno, L, Yizhar, O and Deisseroth, K (2011) The development and application of optogenetics. Annual Review of Neuroscience 34, 389412.Google Scholar
Findlay, JB and Pappin, DJ (1986) The opsin family of proteins. Biochemical Journal 238(3), 625642.Google Scholar
Gardner, JC, Webb, TR, Kanuga, N, Robson, AG, Holder, GE, Stockman, A, Ripamonti, C, Ebenezer, ND, Ogun, O, Devery, S, Wright, GA, Maher, ER, Cheetham, ME, Moore, AT, Michaelides, M and Hardcastle, AJ (2010) X-linked cone dystrophy caused by mutation of the red and green cone opsins. The American Journal of Human Genetics 87(1), 2639.Google Scholar
Gmelin, W, Zeth, K, Efremov, R, Heberle, J, Tittor, J and Oesterhelt, D (2007) The crystal structure of the l1 intermediate of halorhodopsin at 1.9 Å resolution. Photochemistry and Photobiology 83(2), 369377.Google Scholar
Govardovskii, VI (1971) Sodium and potassium concentration gradient in the retinal rod outer segment. Nature New Biology 234(45), 5354.Google Scholar
Gühmann, M, Porter, ML and Bok, MJ (2022) The gluopsins: Opsins without the retinal binding lysine. Cells 11(15), 2441.Google Scholar
Han, X and Boyden, ES (2007) Multiple-color optical activation, silencing, and desynchronization of neural activity, with single-spike temporal resolution. PLoS One 2(3), e299.Google Scholar
Hao, S, Jin, D, Zhang, S and Qing, R (2020) Qty code-designed water-soluble fc-fusion cytokine receptors bind to their respective ligands. QRB Discovery 1, e4.Google Scholar
Huang, J, Rauscher, S, Nawrocki, G, Ran, T, Feig, M, de Groot, BL, Grubmüller, H and MacKerell, AD (2017) Charmm36m: An improved force field for folded and intrinsically disordered proteins. Nature Methods 14(1), 7173.Google Scholar
Hubbard, R and Kropf, A (1958) The action of light on rhodopsin. Proceedings of the National Academy of Sciences 44(2), 130139.Google Scholar
Jo, S, Kim, T, Iyer, VG and Im, W (2008) Charmm-gui: A web-based graphical user interface for charmm. Journal of Computational Chemistry 29(11), 18591865.Google Scholar
Johnsson, F, Karagöl, T, Karagöl, A and Zhang, S (2025) Structural bioinformatic study of six human olfactory receptors and their alphafold3 predicted water-soluble qty variants and or1a2 with an odorant octanoate and taar9 with spermidine. QRB Discovery 6, e2.Google Scholar
Jumper, J, Evans, R, Pritzel, A, Green, T, Figurnov, M, Ronneberger, O, Tunyasuvunakool, K, Bates, R, Žídek, A, Potapenko, A, Bridgland, A, Meyer, C, Kohl, SAA, Ballard, AJ, Cowie, A, Romera-Paredes, B, Nikolov, S, Jain, R, Adler, J, Back, T, Petersen, S, Reiman, D, Clancy, E, Zielinski, M, Steinegger, M, Pacholska, M, Berghammer, T, Bodenstein, S, Silver, D, Vinyals, O, Senior, AW, Kavukcuoglu, K, Kohli, P and Hassabis, D (2021) Highly accurate protein structure prediction with alphafold. Nature 596(7873), 583589.Google Scholar
Karagöl, A, Karagöl, T, Smorodina, E and Zhang, S (2024a) Structural bioinformatics studies of glutamate transporters and their alphafold2 predicted water-soluble qty variants and uncovering the natural mutations of l-q, i-t, f-y and q-l, t-i and y-f. PLoS One 19(4), e0289644.Google Scholar
Karagöl, A, Karagöl, T and Zhang, S (2024b) Molecular dynamic simulations reveal that water-soluble qty-variants of glutamate transporters eaa1, eaa2 and eaa3 retain the conformational characteristics of native transporters. Pharmaceutical Research 41(10), 19651977.Google Scholar
Karagöl, T, Karagöl, A and Zhang, S (2024c) Structural bioinformatics studies of serotonin, dopamine and norepinephrine transporters and their alphafold2 predicted water-soluble qty variants and uncovering the natural mutations of l-q, i-t, f-y and q-l, t-i and y-f. PLoS One 19(3), e0300340.Google Scholar
Lee, J, Cheng, X, Swails, JM, Yeom, MS, Eastman, PK, Lemkul, JA, Wei, S, Buckner, J, Jeong, JC, Qi, Y, Jo, S, Pande, VS, Case, DA, Brooks, CLI, MacKerell, ADJ, Klauda, JB and Im, W (2016) Charmm-gui input generator for namd, gromacs, amber, openmm, and charmm/openmm simulations using the charmm36 additive force field. Journal of Chemical Theory and Computation 12(1), 405413.Google Scholar
Li, M, Tang, H, Qing, R, Wang, Y, Liu, J, Wang, R, Lyu, S, Ma, L, Xu, P, Zhang, S and Tao, F (2024a) Design of a water-soluble transmembrane receptor kinase with intact molecular function by qty code. Nature Communications 15(1), 4293.Google Scholar
Li, M, Wang, Y, Tao, F, Xu, P and Zhang, S (2024b) Qty code designed antibodies for aggregation prevention: A structural bioinformatic and computational study. Proteins: Structure, Function, and Bioinformatics 92(2), 206218.Google Scholar
Mahalingam, M, Martínez-Mayorga, K, Brown, MF and Vogel, R (2008) Two protonation switches control rhodopsin activation in membranes. Proceedings of the National Academy of Sciences 105(46), 1779517800.Google Scholar
Nagel, G, Szellas, T, Huhn, W, Kateriya, S, Adeishvili, N, Berthold, P, Ollig, D, Hegemann, P and Bamberg, E (2003) Channelrhodopsin-2, a directly light-gated cation-selective membrane channel. Proceedings of the National Academy of Sciences 100(24), 1394013945.Google Scholar
Nathans, J and Hogness, DS (1984) Isolation and nucleotide sequence of the gene encoding human rhodopsin. Proceedings of the National Academy of Sciences of the United States of America 81(15), 48514855.Google Scholar
Nikolaev, A, Orlov, Y, Tsybrov, F, Kuznetsova, E, Shishkin, P, Kuzmin, A, Mikhailov, A, Nikolaeva, YS, Anuchina, A, Chizhov, I, Semenov, O, Kapranov, I, Borshchevskiy, V, Remeeva, A and Gushchin, I (2025) Engineering of soluble bacteriorhodopsin. Chemical Science 16(24), 1106711076.Google Scholar
Oesterhelt, D and Stoeckenius, W (1971) Rhodopsin-like protein from the purple membrane of Halobacterium halobium. Nature New Biology 233(39), 149152.Google Scholar
Okada, T, Ernst, OP, Palczewski, K and Hofmann, KP (2001) Activation of rhodopsin: New insights from structural and biochemical studies. Trends in Biochemical Sciences 26(5), 318324.Google Scholar
Palczeski, K, Behnke, CA, Motoshima, H and Fox, BA (2000) Crystal structure of rhodopsin: A g protein-coupled receptor. Science 289, 739745.Google Scholar
Pan, E, Tao, F, Smorodina, E and Zhang, S (2024) Structural bioinformatics studies of six human abc transporters and their alphafold2-predicted water-soluble qty variants. QRB Discovery 5, e1.Google Scholar
Provencio, I, Jiang, G, De Grip, WJ, Hayes, WP and Rollag, MD (1998) Melanopsin: An opsin in melanophores, brain, and eye. Proceedings of the National Academy of Sciences 95(1), 340345.Google Scholar
Qing, R, Han, Q, Skuhersky, M, Chung, H, Badr, M, Schubert, T and Zhang, S (2019) Qty code designed thermostable and water-soluble chimeric chemokine receptors with tunable ligand affinity. Proceedings of the National Academy of Sciences 116(51), 2566825676.Google Scholar
Qing, R, Hao, S, Smorodina, E, Jin, D, Zalevsky, A and Zhang, S (2022) Protein design: From the aspect of water solubility and stability. Chemical Reviews 122(18), 1408514179.Google Scholar
Qing, R, Xue, M, Zhao, J, Wu, L, Breitwieser, A, Smorodina, E, Schubert, T, Azzellino, G, Jin, D, Kong, J, Palacios, T, Sleytr, UB and Zhang, S (2023) Scalable biomimetic sensing system with membrane receptor dual-monolayer probe and graphene transistor arrays. Science Advances 9(29), eadf1402.Google Scholar
Radu, RA, Hu, J, Peng, J, Bok, D, Mata, NL and Travis, GH (2008) Retinal pigment epithelium-retinal g protein receptor-opsin mediates light-dependent translocation of all-trans-retinyl esters for synthesis of visual chromophore in retinal pigment epithelial cells. The Journal of Biological Chemistry 283(28), 1973019738.Google Scholar
Sajeev-Sheeja, A and Zhang, S (2024) Structural molecular modeling of bacterial integral membrane protein enzymes and their alphafold2 predicted water-soluble qty variants. Journal of Proteins and Proteomics 15(4), 635645.Google Scholar
Sajeev-Sheeja, A, Smorodina, E and Zhang, S (2023) Structural bioinformatics studies of bacterial outer membrane beta-barrel transporters and their alphafold2 predicted water-soluble qty variants. PLoS One 18(8), e0290360.Google Scholar
Sakai, D, Tomita, H and Maeda, A (2022) Optogenetic therapy for visual restoration. International Journal of Molecular Sciences 23(23), 15041.Google Scholar
Schobert, B and Lanyi, JK (1982) Halorhodopsin is a light-driven chloride pump. Journal of Biological Chemistry 257(17), 1030610313.Google Scholar
Shen, D, Jiang, M, Hao, W, Tao, L, Salazar, M and Fong, HK (1994) A human opsin-related gene that encodes a retinaldehyde-binding protein. Biochemistry 33(44), 1311713125.Google Scholar
Skuhersky, M, Tao, F, Qing, R, Smorodina, E, Jin, D and Zhang, S (2021) Comparing native crystal structures and alphafold2 predicted water-soluble g protein-coupled receptor qty variants. Life 11(12), 1285.Google Scholar
Smorodina, E, Tao, F, Qing, R, Jin, D, Yang, S and Zhang, S (2022) Comparing 2 crystal structures and 12 alphafold2-predicted human membrane glucose transporters and their water-soluble glutamine, threonine and tyrosine variants. QRB Discovery 3, e5.Google Scholar
Smorodina, E, Tao, F, Qing, R, Yang, S and Zhang, S (2024) Computational engineering of water-soluble human potassium ion channels through qty transformation. Scientific Reports 14(1), 28159.Google Scholar
Spudich, JL, Yang, C-S, Jung, K-H and Spudich, EN (2000) Retinylidene proteins: Structures and functions from archaea to humans. Annual Review of Cell and Developmental Biology 16, 365392.Google Scholar
Sun, H, Gilbert, DJ, Copeland, NG, Jenkins, NA and Nathans, J (1997) Peropsin, a novel visual pigment-like protein located in the apical microvilli of the retinal pigment epithelium. Proceedings of the National Academy of Sciences of the United States of America 94(18), 98939898.Google Scholar
Tao, F, Tang, H, Zhang, S, Li, M and Xu, P (2022) Enabling qty server for designing water-soluble alpha-helical transmembrane proteins. MBio 13(1), e03604e03621.Google Scholar
Tarttelin, EE, Bellingham, J, Hankins, MW, Foster, RG and Lucas, RJ (2003) Neuropsin (opn5): A novel opsin identified in mammalian neural tissue. FEBS Letters 554(3), 410416.Google Scholar
Tegler, L, Corin, K, Pick, H, Brookes, J, Skuhersky, M, Vogel, H and Zhang, S (2020) The g protein coupled receptor cxcr4 designed by the qty code becomes more hydrophilic and retains cell signaling activity. Scientific Reports 10(1), 21371.Google Scholar
Terakita, A (2005) The opsins. Genome Biology 6(3), 213.Google Scholar
Ueyama, H, Kuwayama, S, Imai, H, Tanabe, S, Oda, S, Nishida, Y, Wada, A, Shichida, Y and Yamade, S (2002) Novel missense mutations in red/green opsin genes in congenital color-vision deficiencies. Biochemical and Biophysical Research Communications 294(2), 205209.Google Scholar
Wissinger, B, Baumann, B, Buena-Atienza, E, Ravesh, Z, Cideciyan, AV, Stingl, K, Audo, I, Meunier, I, Bocquet, B, Traboulsi, EI, Hardcastle, AJ, Gardner, JC, Michaelides, M, Branham, KE, Rosenberg, T, Andreasson, S, Dollfus, H, Birch, D, Vincent, AL, Martorell, L, Català Mora, J, Kellner, U, Rüther, K, Lorenz, B, Preising, MN, Manfredini, E, Zarate, YA, Vijzelaar, R, Zrenner, E, Jacobson, SG and Kohl, S (2022) The landscape of submicroscopic structural variants at the opn1lw/opn1mw gene cluster on xq28 underlying blue cone monochromacy. Proceedings of the National Academy of Sciences 119(27), e2115538119.Google Scholar
Wu, EL, Cheng, X, Jo, S, Rui, H, Song, KC, Dávila-Contreras, EM, Qi, Y, Lee, J, Monje-Galvan, V, Venable, RM, Klauda, JB and Im, W (2014) Charmm-gui membrane builder toward realistic biological membrane simulations. Journal of Computational Chemistry 35(27), 19972004.Google Scholar
Yee, DC, Shlykov, MA, Vastermark, A, Reddy, VS, Arora, S, Sun, EI and Saier, MH (2013) The transporter–opsin– G protein-coupled receptor (tog) superfamily. The FEBS Journal 280(22), 57805800.Google Scholar
Zhang, F, Wang, L-P, Brauner, M, Liewald, JF, Kay, K, Watzke, N, Wood, PG, Bamberg, E, Nagel, G, Gottschalk, A and Deisseroth, K (2007) Multimodal fast optical interrogation of neural circuitry. Nature 446(7136), 633639.Google Scholar
Zhang, S and Egli, M (2022) Hiding in plain sight: Three chemically distinct alpha-helix types. Quarterly Reviews of Biophysics 55, e7.Google Scholar
Zhang, S, Tao, F, Qing, R, Tang, H, Skuhersky, M, Corin, K, Tegler, L, Wassie, A, Wassie, B, Kwon, Y, Suter, B, Entzian, C, Schubert, T, Yang, G, Labahn, J, Kubicek, J and Maertens, B (2018) Qty code enables design of detergent-free chemokine receptors that retain ligand-binding activities. Proceedings of the National Academy of Sciences 115(37) E8652E8659.Google Scholar
Zhou, Q, Yang, D, Wu, M, Guo, Y, Guo, W, Zhong, L, Cai, X, Dai, A, Jang, W, Shakhnovich, EI, Liu, Z-J, Stevens, RC, Lambert, NA, Babu, MM, Wang, M-W and Zhao, S (2019) Common activation mechanism of class a gpcrs. eLife 8, e50279.Google Scholar
Figure 0

Table 1. Protein characteristics of 12 retinylidene proteins and their QTY analogs

Figure 1

Figure 1. The protein sequence alignments of 12 retinylidene proteins and their QTY analogs. Blue tables: human opsins; purple tables: microbial opsins. The symbols $ \mid $ and $ \ast $ indicate that amino acids are identical or different, respectively. Amino acids L, I/V, and F in TM (transmembrane) alpha helices (shown in blue above the sequences) are replaced with Q, T, and Y, respectively. The variation in the TM domain ranges from 35.53% to 50.24%, while the overall variation rate ranges from 15.48% to 30.04%. The characteristics of the proteins are shown above the sequences. Despite large variation rates, the pI only experiences a slight change between 0.00 and 0.16 and the MW increases slightly by a value between 0.04 and 0.60 kDa. The enlarged individual sequence alignments are available in supplementary information. The alignments are (a) OPN1MW versus OPN1MW$ {}^{\mathrm{QTY}} $, (b) OPN1LW versus OPN1LW$ {}^{\mathrm{QTY}} $, (c) OPN1SW versus OPN1SW$ {}^{\mathrm{QTY}} $, (d) OPN2 versus OPN2$ {}^{\mathrm{QTY}} $, (e) OPN3 versus OPN3$ {}^{\mathrm{QTY}} $, (f) OPN4 versus OPN4$ {}^{\mathrm{QTY}} $, (g) OPN5 versus OPN5$ {}^{\mathrm{QTY}} $, (h) RGR versus RGR$ {}^{\mathrm{QTY}} $, (i) RRH versus RRH$ {}^{\mathrm{QTY}} $, (j) BACR versus BACR$ {}^{\mathrm{QTY}} $, (k) BACH versus BACH$ {}^{\mathrm{QTY}} $, and (l) ChR2 versus ChR2$ {}^{\mathrm{QTY}} $.

Figure 2

Figure 2. Superposition of AlphaFold3-predicted native human retinylidene proteins, their QTY analogs, and experimentally determined structures. For clarity, unstructured N- and C-terminal ends are deleted. For (a) to (i), despite significant changes in the protein sequence, the structures superpose very well. The root-mean-square distance (RMSD) values are quite small, from 0.307 to 0.611 Å, with only one exception (OPN2$ {}^{\mathrm{QTY}} $ vs. OPN2$ {}^{\mathrm{EXP}} $, RMSD = 0.999 Å). Green: AlphaFold3-predicted native structure; cyan: AlphaFold3-predicted QTY analog structure; magenta: experimentally determined structure. The superpositions are (a) OPN1MW$ {}^{\mathrm{AF}3} $ versus OPN1MW$ {}^{\mathrm{QTY}} $, (b) OPN1LW$ {}^{\mathrm{AF}3} $ versus OPN1LW$ {}^{\mathrm{QTY}} $, (c) OPN1SW$ {}^{\mathrm{AF}3} $ versus OPN1SW$ {}^{\mathrm{QTY}} $, (d) OPN2$ {}^{\mathrm{AF}3} $ versus OPN2$ {}^{\mathrm{QTY}} $ versus OPN2$ {}^{\mathrm{EXP}} $, (e) OPN3$ {}^{\mathrm{AF}3} $ versus OPN3$ {}^{\mathrm{QTY}} $, (f) OPN4$ {}^{\mathrm{AF}3} $ versus OPN4$ {}^{\mathrm{QTY}} $, (g) OPN5$ {}^{\mathrm{AF}3} $ versus OPN5$ {}^{\mathrm{QTY}} $, (h) RGR$ {}^{\mathrm{AF}3} $ versus RGR$ {}^{\mathrm{QTY}} $, and (i) RRH$ {}^{\mathrm{AF}3} $ versus RRH$ {}^{\mathrm{QTY}} $. For (j) and (k), there is a large degree of similarity between the RMSD between a pair of native proteins and that between the corresponding pair of QTY analogs. Green: OPN1MW; red: OPN1LW; blue: OPN1SW; purple: OPN2; cyan: OPN3; gray: OPN4; olive: OPN5; orange: RGR; pink: RRH. The superpositions are (j) OPN1MW$ {}^{\mathrm{AF}3} $ versus OPN1LW$ {}^{\mathrm{AF}3} $ versus OPN1SW$ {}^{\mathrm{AF}3} $ versus OPN2$ {}^{\mathrm{AF}3} $ versus OPN3$ {}^{\mathrm{AF}3} $ versus OPN4$ {}^{\mathrm{AF}3} $ versus OPN5$ {}^{\mathrm{AF}3} $ versus RGR$ {}^{\mathrm{AF}3} $ versus RRH$ {}^{\mathrm{AF}3} $ and (k) OPN1MW$ {}^{\mathrm{QTY}} $ versus OPN1LW$ {}^{\mathrm{QTY}} $ versus OPN1SW$ {}^{\mathrm{QTY}} $ versus OPN2$ {}^{\mathrm{QTY}} $ versus OPN3$ {}^{\mathrm{QTY}} $ versus OPN4$ {}^{\mathrm{QTY}} $ versus OPN5$ {}^{\mathrm{QTY}} $ versus RGR$ {}^{\mathrm{QTY}} $ versus RRH$ {}^{\mathrm{QTY}} $.

Figure 3

Figure 3. Superposition of AlphaFold3-predicted native microbial retinylidene proteins, their QTY analogs, and experimentally determined structures. Despite significant changes in the protein sequence, the structures superpose very well. The root-mean-square distance (RMSD) values are quite small, with the highest being 0.685 Å. For clarity, unstructured N- and C-terminal ends are deleted. Green: AlphaFold3-predicted native structure; cyan: AlphaFold3-predicted QTY analog structure; magenta: experimentally determined structure. The superpositions are (a) BACR$ {}^{\mathrm{AF}3} $ versus BACR$ {}^{\mathrm{QTY}} $ versus BACR$ {}^{\mathrm{EXP}} $ monomer, (b) BACR$ {}^{\mathrm{AF}3} $ versus BACR$ {}^{\mathrm{QTY}} $ versus BACR$ {}^{\mathrm{EXP}} $ trimer, (c) BACH$ {}^{\mathrm{AF}3} $ versus BACH$ {}^{\mathrm{QTY}} $ versus BACH$ {}^{\mathrm{EXP}} $ monomer, (d) BACH$ {}^{\mathrm{AF}3} $ versus BACH$ {}^{\mathrm{QTY}} $ versus BACH$ {}^{\mathrm{EXP}} $ trimer, (e) ChR2$ {}^{\mathrm{AF}3} $ versus ChR2$ {}^{\mathrm{QTY}} $ versus ChR2$ {}^{\mathrm{EXP}} $ monomer, and (f) ChR2$ {}^{\mathrm{AF}3} $ versus ChR2$ {}^{\mathrm{QTY}} $ versus ChR2$ {}^{\mathrm{EXP}} $ dimer.

Figure 4

Figure 4. Hydrophobic surface of 12 retinylidene proteins and their water-soluble QTY analogs. Hydrophobic patches are shown in yellow, while hydrophilic patches are shown in cyan. The native proteins have many hydrophobic patches due to the presence of hydrophobic amino acids, including L, I, V, and F. After QTY substitution, hydrophilic Q, T, and Y have respectively replaced hydrophobic L, I/V, and F, and the hydrophobic patches in the surface of transmembrane helices have become more hydrophilic. In addition, the surface shape of the native and QTY analogs are very similar. For clarity, unstructured N- and C-terminal ends are deleted. The comparisons are (a) OPN1MW versus OPN1MW$ {}^{\mathrm{QTY}} $, (b) OPN1LW versus OPN1LW$ {}^{\mathrm{QTY}} $, (c) OPN1SW versus OPN1SW$ {}^{\mathrm{QTY}} $, (d) OPN2 versus OPN2$ {}^{\mathrm{QTY}} $, (e) OPN3 versus OPN3$ {}^{\mathrm{QTY}} $, (f) OPN4 versus OPN4$ {}^{\mathrm{QTY}} $, (g) OPN5 versus OPN5$ {}^{\mathrm{QTY}} $, (h) RGR versus RGR$ {}^{\mathrm{QTY}} $, (i) RRH versus RRH$ {}^{\mathrm{QTY}} $, (j) BACR versus BACR$ {}^{\mathrm{QTY}} $ monomer, (k) BACH versus BACH$ {}^{\mathrm{QTY}} $ monomer, (l) ChR2 versus ChR2$ {}^{\mathrm{QTY}} $ monomer, (m) BACR versus BACR$ {}^{\mathrm{QTY}} $ trimer, (n) BACH versus BACH$ {}^{\mathrm{QTY}} $ trimer, and (o) ChR2 versus ChR2$ {}^{\mathrm{QTY}} $ dimer.

Figure 5

Figure 5. The conformational changes of native OPN2 and its QTY analog before and after 11-cis to all-trans isomerization of the chromophore, retinal. (a, b) 1 ns running averages of the root-mean-square distances (RMSD) of the protein–retinal complex, transmembrane helix 6 (TM6), the retinal-binding pocket, and retinal. By convention, the isomerization is set at time 0 ns, which is indicated by a brown, vertical dashed line. (c) Superpositions between cis-state OPN2, trans-state OPN2, cis-state OPN2$ {}^{\mathrm{QTY}} $, and trans-state OPN2$ {}^{\mathrm{QTY}} $. Both OPN2 and OPN2$ {}^{\mathrm{QTY}} $ exhibit conformational changes, with RMSDs greater than 2 Å. Blue: cis-state protein, orange: 11-cis-retinal; yellow: trans-state protein; greenish cyan: all-trans retinal.

Figure 6

Figure 6. Changes in the retinal-binding pocket and protein–ligand interaction in native OPN2 and its QTY analog before and after 11-cis to all-trans isomerization of the chromophore. (a, b) Close-ups of the binding pocket in cis-state. Protein–ligand interactions with lengths shorter or equal to 3.5 Å are shown in the figure. Blue: protein residues, orange: 11-cis-retinal; green dashed lines: ion bridge; yellow dashed lines: van der Waals and/or hydrophobic interactions. (c, d) Close-ups of the binding pocket in trans-state. Protein–ligand interactions with lengths shorter or equal to 3.5 Å are shown in the figure. Yellow: protein residues, greenish cyan: all-trans-retinal; green dashed lines: ion bridge; yellow dashed lines: van der Waals and/or hydrophobic interactions. (e, f) The interaction energies (IEs) between the protein, the binding pocket, individual residues, and retinal. IE is the sum of the short-range Coulombic interaction energy and short-range Lennard–Jones energy. Note that IE is a product of MD simulation and is not necessarily a ‘real’ physical quantity. For clarity, IE is rescaled using the signed pseudo logarithm ($ y=\operatorname{sign}(x)\cdot \ln \left(|x|+1\right) $). By convention, the isomerization is set at time 0 ns, which is indicated by a brown, vertical dashed line. The large changes in IE in OPN2 around $ t=50 $ ns are due to the entrance of water molecules into the binding pocket, near 113E. The QTY analog has more negative IE at residues 208 and 212 due to F to Y substitutions. Besides from these, the similarity between the IE of OPN2 and OPN2$ {}^{\mathrm{QTY}} $ is a strong suggestion of functional conservation. (g, h) The number of hydrogen bonds formed between residues in the binding pocket, and the number of water molecules within the binding pocket. By convention, the isomerization is set at time 0 ns, which is indicated by a brown, vertical dashed line.

Supplementary material: File

Pan supplementary material

Pan supplementary material
Download Pan supplementary material(File)
File 14.4 MB

Author comment: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR1

Comments

Dear Editor,

This study is part of a series of studies on protein design with the QTY code by Prof. Shuguang Zhang and his team. I extended QTY-design to a new family of proteins, the retinylidene proteins.

I completed this manuscript individually while maintaining close communication with Prof. Zhang. He has recommended this manuscript to Prof. Bengt Nordén, who has encouraged me to submit it to QRB Discovery.

Thank you very much for considering my work. I look forward to your response.

Sincerely,

Siqi Pan

Email: siqipan2008@outlook.com

Review: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR2

Conflict of interest statement

N/A

Comments

This manuscript presents a bioinformatic study of the structure and properties of retinylidene proteins and their QTY-designed variants using AlphaFold3. Both the natural OPN2 and its QTY-designed variant were analyzed using AlphaFold3 and molecular dynamics simulations to investigate their responses to the isomerization of 11-cis-retinal to all-trans-retinal. The study is well-conducted and suitable for publication in QRB Discovery following revision.

1. The manuscript could provide more background and discussion on the downstream signaling pathways of OPN2. What is the potential significance for the treatment of related diseases and the underlying mechanisms of signal transduction?

2. The current MD analysis mainly focuses on structural changes, but it is recommended to include more targeted analysis of functionally relevant regions. Additionally, please clarify how the molecular dynamics simulation setup relates to real physiological processes.

3. In Figure 5, the RMSD of the protein and retinal in the QTY analog appears to be still fluctuating. If longer simulations are not feasible, it is recommended to include statistical analyses over different time windows to demonstrate whether the structure is being stabilized.

4. Please clarify the role and significance of the reverse-QTY design mentioned at the end of the introduction in the context of this study.

5. The first citation of Karagol, A. on page 8 contains garbled characters. This journal name of this citation is also not consistent with others. The citation in the third line of page 9 is incomplete. Please check all reference formats throughout the manuscript.

6. The uses of past and present tenses are not consistent. The manuscript would benefit from editing by a native speaker.

Review: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

This study applies a water solubilization design method called QTY code to monoamine transporters, computationally characterizing some properties of the designed proteins and investigating the relationship between natural mutations in these monoamine transporters and the QTY code. These findings are intriguing and hold some significance for the study on monoamine transporters and the QTY code; however, there are some issues in the manuscript should be addressed by the authors.

Recommendation: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR4

Comments

No accompanying comment.

Decision: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R0/PR5

Comments

No accompanying comment.

Author comment: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R1/PR6

Comments

No accompanying comment.

Review: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R1/PR7

Conflict of interest statement

N/A.

Comments

The author resolved my questions and I think the manuscript is good for acceptance.

Recommendation: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R1/PR8

Comments

No accompanying comment.

Decision: A structural and functional bioinformatics study of QTY-designed retinylidene proteins — R1/PR9

Comments

No accompanying comment.