Increased layer 5 Martinotti cell excitation reduces pyramidal cell population plasticity and improves learned motor execution

Curation statements for this article:
  • Curated by eLife

    eLife logo

    eLife Assessment

    This valuable study addresses a critical question regarding the role of a subpopulation of cortical interneurons (Chrna2-expressing Martinotti cells) in motor learning and cortical dynamics. However, despite the inclusion of interesting behavioral and imaging data, significant limitations remain, even after revision, in the design of the motor learning task and the associated data analyses. As a result, the presented data are incomplete to support the central conclusions.

This article has been Reviewed by the following groups

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Abstract

During motor activity and motor learning, pyramidal cells in the motor cortex receive inputs from local interneurons as well as deeper structures. Layer 5 pyramidal cells in the primary motor cortex then feed commands to spinal circuits for motor execution. The genetic ablation of layer 5 Chrna2 Martinotti cells, which selectively target pyramidal tract pyramidal cells, resulted in disturbed fine motor functions. Using calcium imaging combined with chemogenetics, we show that activation of layer 5 Chrna2 Martinotti cells during training increases pyramidal cell tuning, changes responses temporal patterns and decreases assembly reconfiguration, while not affecting motor learning success rates. However, in mice that had already learned a reach-and-grasp (prehension) task, Chrna2 Martinotti cell activation resulted in improved prehension and increased power in low theta and high gamma bands of local field potentials in the motor cortex. This work indicates that activation of Chrna2 Martinotti cells reduces pyramidal cell assembly plasticity during learning, possibly facilitating already acquired motor skills.

Article activity feed

  1. eLife Assessment

    This valuable study addresses a critical question regarding the role of a subpopulation of cortical interneurons (Chrna2-expressing Martinotti cells) in motor learning and cortical dynamics. However, despite the inclusion of interesting behavioral and imaging data, significant limitations remain, even after revision, in the design of the motor learning task and the associated data analyses. As a result, the presented data are incomplete to support the central conclusions.

  2. Reviewer #1 (Public review):

    In this study, the authors investigated a specific subtype of SST-INs (layer 5 Chrna2-expressing Martinotti cells) and examined its functional role in motor learning.

    Most of the issues remain unaddressed. The findings across experiments are inconsistent, and it is unclear how the authors performed their analyses or why specific time points and comparisons were chosen. The study will require major re-analyzing and additional experiments to substantiate its conclusions.

    After reading the reviewers' responses, my major concerns about the manuscript remain unresolved, particularly regarding the arbitrarily defined stages of learning in the motor learning task and how the calcium imaging data align with the animal's movements.

    - In line 331, the authors refer to session 5 as "training," describing it as the final spoon session, and session 6 as "re-training," because it is the first session in which the pellet is presented on the plate rather than on the spoon. However, in Fig. 1F-H, even in the Ctrl group, it is clear that the performance drops significantly in session 5, which is supposed to be the easiest session before switching to the more difficult plate condition.

    - In the classic pellet-reaching task, the spoon sessions would typically be considered "shaping", while the plate sessions would represent the actual training phase. However, in this manuscript, the authors still insist on referring to session 2 as "learning" and session 5 as "training." I don't understand the difference between session 2 and session 5, especially when session 5's performance is lower than session 2 (even in Fig 1H when you compare succ ratio).

    - Since session 6 (on the plate) is considered as "retraining," why don't the authors present the behavioral results beyond session 6? As a result, it remains unclear whether the animals improved their performance during the retraining phase.

    - Lastly, in Fig. 4B the authors present only the success ratio and claim that performance improves with CLZ application. However, when comparing sessions 8-10 between the Ctrl and Cre⁺ groups, there already appears to be a baseline difference. CLZ treatment in Cre⁺ mice seem to bring performance only to the WT level rather than producing a clear improvement beyond baseline.

    - Regarding the alignment between imaging and behavior, the authors report ~100 prehensions per minute. However, the calcium imaging traces show fewer than 20-30 spikes over 150 seconds (~2.5 min; Fig. 1E). This discrepancy raises concerns about whether the authors can truly isolate calcium signals corresponding to individual prehension events (either successful ones or multiple combined events for unsuccessful attempts). The manuscript still does not present behavioral data that directly aligns prehension events with calcium imaging activity. Although the authors performed analyses suggesting that prehension-related activity does not systematically alter non-prehension epochs, this claim is difficult to evaluate without seeing the underlying traces. It is therefore unclear how the authors selected the example calcium traces aligned to prehension onset, given that there are more than 100 prehension events per minute.

    - In Fig. 1I, the authors also did not address why neural activity during successful trials is already lower one second before movement onset. The longer traces provided do not help to explain this observation or clarify the origin of this pre-movement reduction in activity. It actually further suggests that there may be some artifacts in the imaging that could affect the analysis.

    - Overall, because it remains difficult to understand exactly what the authors are analyzing (and because the definitions of the motor learning stages appear arbitrary) it is difficult to agree with the authors' conclusion that Ma2s cells reduce PyrN cell assembly plasticity during learning, thereby possibly facilitating already acquired motor skills.

  3. Reviewer #2 (Public review):

    Summary:

    In this manuscript, Malfatti et al. study the role of Chrna2 Martinotti cells (Mα2 cells), a subset of SST interneurons, for motor learning and motor cortex activity. The authors trained mice on a forelimb prehension task while recording neuronal activity of pyramidal cells using calcium imaging with a head mounted miniscope. While chemogenetically increasing Mα2 cell activity did not affect motor learning, it changed pyramidal cell activity such that activity peaks become sharper and differently timed than in control mice. Moreover, co-active neuronal assemblies become more stable with a smaller spatial distribution. Increasing Mα2 cell activity in previously trained mice did increase performance on the prehension task and led to increased theta and gamma band activity in the motor cortex. On the other hand, genetic ablation of Mα2 cells affected fine motor movements on a pasta handling task while not affecting the prehension task. While overall this study addresses an important and timely question, limitations in the design of the motor learning task and data analysis significantly weaken the conclusions drawn in this manuscript.

    Strengths:

    The proposed question of how Chrna2-expressing SST interneurons affect motor learning and motor cortex activity is important and timely. The study employs sophisticated approaches to record neuronal activity and manipulate the activity of a specific neuronal population in behaving mice over the course of motor learning. The authors analyze a variety of neuronal activity parameters, comparing different behavior trials, stages of learning, and the effects of Mα2 cell activation. The analysis of neuronal assembly activity and stability over the course of learning by tracking individual neurons throughout the imaging sessions is notable, since technically challenging, and yielded the interesting result that neuronal assemblies are more stable when activating Mα2 cells.

    Overall, the study provides compelling evidence that Mα2 cells regulate certain aspects of motor behaviors, likely by shaping circuit activity in the motor cortex.

    Weaknesses:

    While the authors addressed some of the concerns raised by the reviewers, several major limitations still exist in the revised manuscript.

    (1) I appreciate the authors now showing more measures of the prehension task (total reaches, success reaches/min, and success ratio) and providing more details on the task design. However, it is unclear why the authors chose a task design that is somewhat different from the commonly used approach. Here they increase the distance of the food pellet each session and are thus making the task increasingly harder, whereas commonly the target distance is kept stable (See 10.1038/nature08389 for example). The result is that important readouts of learning (e. g. success rate) thus remain stable, making it impossible to judge if learning has occurred, without a control group of non-trained mice. This makes it impossible to judge if the task is affected by increased Mα2 cell excitability, since there is no reference of how these measurements are supposed to change in a mouse that learns or doesn't learn the task.

    (2) Regarding the analysis of the calcium imaging data, it is still unclear why the authors cannot report a commonly used dF/F0 or z-score value, as recommended by both reviewers. The authors state the 1 sec time window prior to the prehension cannot be used as a baseline (F0), as there might be preparatory motor activity. In that case an even earlier window (such as -2 to -1sec) or z-scores should be used. The current version relabeling the background subtracted fluorescence signal as dF/F0 is misleading. Relatedly, it is unclear why the authors don't think the 1 sec window before prehension cannot be used as baseline, but at the same time use the difference in calcium activity before and after prehension onset as a cut-off criterion for defining cells as modulated during prehension and including in the analysis.

    (3) While the authors have improved their statistical reporting, key information is still missing in several places. For example, no N-numbers are listed in legends for figure 3, and there is no mention of the number of mice for analysis in figures 2 and 3. For clarity, the authors should also include the statistical test performed in the figure legends for any p-values shown in the figure.

  4. Author Response:

    The following is the authors’ response to the original reviews.

    eLife Assessment

    This valuable study addresses a critical and timely question regarding the role of a subpopulation of cortical interneurons (Chrna2-expressing Martinotti cells) in motor learning and cortical dynamics. However, while some of the behavior and imaging data are impressive, the small sample sizes and incomplete behavioral and activity analyses make interpretation difficult; therefore, they are insufficient to support the central conclusions. The study may be of interest to neuroscientists studying cortical neural circuits, motor learning, and motor control.

    We thank the reviewers and the editors for the insightful comments. We are pleased to report that the raised issues with the manuscript can be addressed by improving clarity in our writing of specific sections and by providing additional analysis. Specifically, it was not clear in the manuscript text that although we show illustrative data with a lower number of animals, our conclusions are supported by data with a larger and sufficient sample size. Also, the description of our control experiments has been improved to clarify our proper treatment controls. We therefore clarify below that our study presents compelling and sufficient evidence to support our conclusions. We have responded to all the comments, explaining how each concern has been addressed. All line and figure numbers mentioned here refer to the numbering of the reviewed manuscript version. All references are cited as DOIs.

    Reviewer #1 (Public review):

    There are many major issues with the study. The findings across experiments are inconsistent, and it is unclear how the authors performed their analyses or why specific time points and comparisons were chosen. The study requires major re-analysis and additional experiments to substantiate its conclusions.

    The main limitation of the study lies in its small sample sizes and the absence of key control experiments, which substantially weaken the strength of the conclusions.

    (1a) Behavior task - the pellet-reaching task is a well-established paradigm in the motor learning field. Why did the authors choose to quantify performance using "success pellets per minute" instead of the more conventional "success rate" (see PMID 19946267, 31901303, 34437845, 24805237)? It is also confusing that the authors describe sessions 1-5 as being performed on a spoon, while from session 6 onward, the pellets are presented on a plate. However, in lines 710-713, the authors define session 1 as "naive," session 2 as "learning," session 5 as "training," and "retraining" as a condition in which a more challenging pellet presentation was introduced. Does "naive session 1" refer to the first spoon session or to session 6 (when the food is presented on a plate)? The same ambiguity applies to "learning session 2," "training session 5," and so on. Furthermore, what criteria did the authors use to designate specific sessions as "learning" versus "training"? Are these definitions based on behavioral performance thresholds or some biological mechanisms? Clarifying these distinctions is essential for interpreting the behavioral results.

    We agree that success rate is a more conventional measure than the number of successful prehensions per minute. We have changed all behavior quantifications to success rate. Note that all behavioral conclusions drawn before are still valid under the new quantification (see Figures 1, 4, and 5). Importantly, the terms “learning,” “training,” and “retraining” were defined based on task structure and prior literature on motor learning stages rather than predetermined behavioral performance thresholds. These labels reflect progression through the task design (initial acquisition, continued practice under stable conditions, and adaptation to altered task demands), not biologically distinct or threshold-defined phases. We have revised the Methods section to make these definitions and transitions explicit to avoid ambiguity in interpreting the behavioral results.

    (1b) Judging from Figures 1F and 4B, even in WT mice, it is not convincing that the animals have actually learned the task. In all figures, the mice generally achieve 10-20 pellets per minute across sessions. The only sessions showing slightly higher performance are session 5 in Figure 1F ("train") and sessions 12 and 13 in Figure 4B ("CLZ"). In the classical pellet-reaching task, animals are typically trained for 10-12 sessions (approximately 60 trials per session, one session per day), and a clear performance improvement is observed over time. The authors should therefore present performance data for each individual session to determine whether there is any consistent improvement across days. As currently shown, performance appears largely unchanged across sessions, raising doubts about whether motor learning actually occurred.

    As described in the methods Single pellet prehension task section, in our setup box, the elevated plate slot for pellet delivery is at a challenging position, outside the slit and 2cm to the right, forcing the mice to use the left paw. Therefore, mice need to be trained in gradually harder positions, using a spoon to deliver the pellet instead of placing it directly at the plate slot. Due to the gradually increasing difficulty in the task, the success rate curve remains flat, while the total number of attempts and number of successful prehensions per minute increase (Figure 1 F-H). We therefore argue that motor learning indeed occurred, with a relatively constant success rate when performing a gradually harder task. Further, the success rate and number of successful prehensions of our mice is within levels previously reported for trained mice (10.3791/51238). We added the precise plate slot position in the methods section to make clearer the need of a gradually increasing difficulty delivery method.

    (1c) The authors also appear to neglect existing literature on the role of SST-INs in motor learning and local circuit plasticity (e.g., PMID 26098758, 36099920). Although the current study focuses on a specific subpopulation of SST-INs, the results reported here are entirely opposite to those of previous studies. The authors should, at a minimum, acknowledge these discrepancies and discuss potential reasons for the differing outcomes in the Discussion section.

    We thank the reviewer for pointing this out. It is by no means a neglect, but a careful balance discussing previous literature that can be fairly compared with our findings. It is becoming increasingly clear — with mounting evidence from modern transcriptomic and connectomic studies — that the canonical “three‑cardinal” interneuron populations (SST⁺, PV⁺, VIP⁺) represent oversimplified groupings that mask considerable heterogeneity. For example, in a comprehensive single-cell RNA‑sequencing (scRNA‑seq) study covering ~1.3 million cells from mouse cortex and hippocampus, the authors identified dozens of discrete GABAergic subtypes beyond the classical marker-defined classes, revealing continuous and graded variation in molecular identity across cortical and hippocampal regions (10.1016/j.cell.2021.04.021). Moreover, a recent study focusing on SST-expressing interneurons demonstrated that even within the SST class there are multiple subtypes with distinct laminar distributions, axonal projection patterns, and circuit connectivity — for instance, two different Martinotti subtypes vs. a non-Martinotti SST subtype targeting different pyramidal neuron types and dendritic compartments (10.1016/j.neuron.2023.05.032). Finally, developmental single‑cell transcriptomics shows that interneuron diversity is already apparent at early postmitotic stages, indicating that these subtypes are pre-specified rather than being mere activity‑dependent states (10.1038/s41467‑018‑07458‑1). These findings argue strongly that the traditional SST⁺ / PV⁺ / VIP⁺ classification, while useful as a coarse heuristic, fails to capture the rich diversity in molecular, morphological, and functional phenotypes that likely underlie distinct roles in circuit computation and behavior.

    The consequence of this is that studies using any of these three markers must be cautiously interpreted since in reality, several quite different neuronal populations are studied at once, especially if no efforts were made to tease out which of the participating populations (inside the “cardinal” population) contribute to the effects seen. Most likely, the reported results are based on a mixed population - in the worst case scenario - populations with opposite effects. In any case, we have now included the role of SST-INs in motor learning and M1 circuitry in the discussion section. We also respectfully disagree that our findings are the opposite of previous SST-IN studies. We show that increasing Ma2 excitability improved execution of an already learned movement, while 10.1038/nn.4049 showed that both activating (which is different from increasing excitability) and inhibiting SST-INs impaired the learning of a stereotyped movement. Similarly, 10.1016/j.neuron.2022.08.018 showed that increasing SST-INs excitability impairs motor learning, not execution of a previously learned movement. While we found that increasing excitability of Ma2 cells did not affect motor learning, note that the Ma2 are a subset of martinotti cells with homogeneous electrophysiological and morphological properties (10.1371/journal.pbio.2001392), and martinotti cells themselves are a subset of SST+ cells (10.1016/j.neuron.2023.05.032). The discussion has been updated to include this reasoning.

    (2a) Calcium imaging - The methodology for quantifying fluorescence changes is confusing and insufficiently described. The use of absolute dF values ("detrended by baseline subtraction," lines 565-567) for analyses that compare activity across cells and animals (e.g., Figure 1H) is highly unconventional and problematic. Calcium imaging is typically reported as dF/F0 or z-scores to account for large variations in baseline fluorescence (F0) due to differences in GCaMP expression, cell size, and imaging quality. Absolute dF values are uninterpretable without reference to baseline intensity - for example, a dF of 5 corresponds to a 100% change in a dim cell (F0 = 5) but only a 1% change in a bright cell (F0 = 500). This issue could confound all subsequent population-level analyses (e.g., mean or median activity) and across-group comparisons. Moreover, while some figures indicate that normalization was performed, the Methods section lacks any detailed description of how this normalization was implemented. The critical parameters used to define the baseline are also omitted. The authors should reprocess the imaging data using a standardized dF/F0 or z-score approach, explicitly define the baseline calculation procedure, and revise all related figures and statistical analyses accordingly.

    The calcium imaging used here is 1-photon microendoscopic video data. To our knowledge, it is not possible to extract the true cell baseline over time from 1-photon data, since the background component includes signals from multiple sources, and usually has fluctuations larger than the neural signal itself. We agree that absolute dF values cannot be compared across cells, and that is not what we report here. The CNMF-E algorithm outputs the temporal activity of each neuron with the background component already removed (10.7554/eLife.28728) and therefore the baseline subtraction used in our study is already standardized (10.7554/eLife.38173). Note that although it is common in the literature to record 1-photon data and perform similar preprocessing (some form of baseline subtraction and/or normalization by noise std), referring to the resulting trace as dF/F, that is not entirely correct, since true F0 extraction is not possible. We thus chose to refer to the resulting preprocessed traces as what they actually are - dF detrended (raw trace with estimated background components removed). However, we agree that a better description of the process would be helpful in our manuscript, and that the nomenclature might be confusing to readers. We therefore expanded the methods section to better explain that we will now refer to F0 as the background component (and refer to our resulting traces as dF/F) and explain how it was determined. We also updated the example traces in Figure 1E to now show the raw traces, the estimated background components and the detrended traces.

    (2b) Figure 1G - It is unclear why neural activity during successful trials is already lower one second before movement onset. Full traces with longer duration before and after movement onset should also be shown. Additionally, only data from "session 2 (learning)" and a single neuron are presented. The authors should present data across all sessions and multiple neurons to determine whether this observation is consistent and whether it depends on the stage of learning.

    We agree that it would be beneficial to show longer traces as an example of prehension-related activity, so we expanded Figure 1I to show a longer trace for a single neuron. We added to Supplemental Figure 2 plots showing longer traces from all sessions including all neurons for both genotypes.

    (2c) Figure 1H - The authors report that chemogenetic activation of Chrna2 cells induces differential changes in PyrN activity between successful and failed trials. However, one would expect that activating all Chrna2 cells would strongly suppress PyrN activity rather than amplifying the activity differences between trials. The authors should clarify the mechanism by which Chrna2 cell activation could exaggerate the divergence in PyrN responses between successful and failed trials. Perhaps, performing calcium imaging of Chrna2 cells themselves during successful versus failed trials would provide insight into their endogenous activity patterns and help interpret how their activation influences PyrN activity during successful and failed trials.

    The reviewer is correct to assume that increasing excitability of Ma2 cells would suppress PC activity. As shown in Supplemental Figure 2I, that is exactly what we observe when considering only non-prehension related activity. Thus, it is very interesting that the opposite effect is seen for prehension-related activity. Also, this finding perfectly aligns with our results from the assembly analysis showing that assembly activity is decreased within the prehension window compared to outside the prehension window. Unfortunately, imaging Ma2 cells would only add information to this study in understanding their influence on PCs if we image both populations simultaneously, which require equipment and reagents we do not currently have. Fortunately, however, the endogenous activity patterns of Ma2 cells and the direct connectivity between Ma2 and pyramidal cells was already previously investigated in detail (10.1371/journal.pbio.2001392), therefore we expanded the discussion to better explain that the differential changes in PC when increasing Ma2 excitability could be due to increased PC synchronization, since a single Ma2 connects to several PCs, and upon inhibition release all connected PCs fire synchronously.

    (2d) Figure 1H - Also, in general, the Cre+ (red) data points appear consistently higher in activity than the Cre- (black) points. This is counterintuitive, as activating Chrna2 cells should enhance inhibition and thereby reduce PyrN activity. The authors should clarify how Cre+ animals exhibit higher overall PyrN activity under a manipulation expected to suppress it. This discrepancy raises concerns about the interpretation of the chemogenetic activation effects and the underlying circuit logic.

    As explained above, increasing Ma2 excitability indeed decreased non-prehension related PC activity, and the proposed mechanism has been added to the discussion section. We also made

    clearer in the results section that we are referring to prehension-related PC activity, and emphasize that overall non-prehension related PC activity is decreased.

    (3) The statistical comparisons throughout the manuscript are confusing. In many cases, the authors appear to perform multiple comparisons only among the N, L, T, and R conditions within the WT group. However, the central goal of this study should be to assess differences between the WT and hM3D groups. In fact, it is unclear why the authors only provide p-values for some comparisons but not for the majority of the groups.

    We agree that a clearer description of the statistical analysis is warranted. We expanded the statistical analysis methods section to clarify, among other things, that all possible pairwise comparisons were performed and appropriately corrected for multiple comparisons, and only positive p-values are reported in the figures, therefore the absence of p-value for a comparison means that is not significant.

    (4a) Figure 4 - It is hard to understand why the authors introduce LFP experiments here, and the results are difficult to interpret in isolation. The authors should consider combining LFP recordings with calcium imaging (as in Figure 1) or, alternatively, repeating calcium imaging throughout the entire re-training period. This would provide a clearer link between circuit activity and behavior and strengthen the conclusions regarding Chrna2 cell function during re-training.

    Unfortunately, it is not possible in our setup to record calcium imaging and LFP simultaneously, since the implants needed for the miniscope occupy the entire space above the animal’s cranium. To record calcium imaging during the execution of learned movements is also impractical. If the animals were to be implanted before the training phase, the signal will likely be too degraded for recordings after the training sessions, since the miniscope signal quality decreases over time, and over successive miniscope attachments. If the animals were to be implanted between the training and retraining phase (as the LFP group), the gap between training and retraining would be even larger, at least 28 days (as opposed to 16 days for the LFP group), which would affect the performance in the task. Therefore, LFP recordings provide understanding of the higher-level changes happening in neural activity when excitation is increased in Ma2 cells during the execution of learned movements. We respectfully disagree that the results from the LFP group cannot be interpreted in isolation, since we found that mice with increased excitability of Ma2 cells display increased low theta and gamma power during the prehension movement. As discussed in the manuscript, the increased high gamma band power when Ma2 cells are overexcitable, particularly for the successful trials in the planning phase, suggest that Ma2 cells may have a role influencing theta and gamma oscillations during motor performance (lines 1348-1355).

    (4b) It is unclear why CLZ has no apparent effect in session 11, yet induces a large performance increase in sessions 12 and 13. Even then, the performance in sessions 12 and 13 (30 successful pellets) is roughly comparable to Session 5 in Figure 1F. Given this, it is questionable whether the authors can conclude that Chrna2 cell activation truly facilitates previously acquired motor skills?

    We understand that a source of confusion for the behavioral data in the LFP group was the absence of data from sessions 1-7, together with the missing explanation about the task changing from spoon to plate (as explained in answers to question 1a and 1b). Since the animals are getting pellets from the spoon in session 5 (easier) and from the plate in later sessions (harder), the fact that animals achieved the same performance in the plate as they had on the last spoon session indicates they relearned the movement. To further clarify the training development, we added the full set of sessions (1-13) to Supplemental Figure 7, indicating the spoon-to-plate switch after session 5 and the 16-days gap between sessions 7 and 8 (due to viral injection and electrodes implant surgeries).

    (5) Figure 5 - The authors report decreased performance in the pasta-handling task (presumably representing a newly learned skill) but observe no difference in the pellet-reaching task (presumably an already acquired skill). This appears to contradict the authors’ main claim that Chrna2 cell activation facilitates previously acquired motor skills.

    We respectfully disagree that the results for the pasta-handling conflict with the finding that increasing Ma2 excitability facilitates previously acquired movements. The pasta handling specifically measures forepaw dexterity (as outlined in lines 442-444), therefore assessing forelimb function unrelated to learning. Mice perform a set of stereotyped movements to manipulate the pasta, therefore no learning is required (note that animals were habituated to the arena, followed by a single test session, with no training sessions). We do specifically mention in the results section that "we used the pasta handling task to assess forepaw dexterity that does not require learning" (lines 1137-1139). Our findings support our reported conclusion that "Ma2 cells may have a role in orchestrating precise forelimb movements that do not require previous specific training" (lines 1154-1156).

    (6) Supplementary Figure 1 - The c-Fos staining appears unusually clean. Previous studies have shown that even in home-cage mice, there are substantial numbers of c-Fos+ cells in M1 under basal conditions (PMID 31901303, 31901303). Additionally, the authors should present Chrna2 cell labeling and c-Fos staining in separate channels. As currently shown, it is difficult to determine whether the c-Fos+ cells are truly Chrna2+ cells.

    Our c-Fos stain does work well after having improved this method in several of our projects. Unfortunately, we could not check the references mentioned in the comment, since it points to a study that did not mention c-Fos (maybe incorrect PMID code?). However, we found our images to have similar c-Fos levels in control as other studies (for example 10.3389/fnana.2014.00013 Figure 1A and 10.1109/TBME.2024.3401136 Supplemental Figure 2C). Thus, we do find background activity of c-Fos in both Cre+ and control mice, but the c-Fos stain appears clean because of the strong up-regulation and fluorescent signal in exogenously activated hM3Dq+ cells. Also, we noticed that the manuscript was missing a methods section for the c-Fos experiments, therefore we added a section detailing the hM3Dq activation validation (lines 487-498). Further, the figure now displays separate channels for hM3Dq + cells (magenta) and c-Fos (cyan) for better clarity.

    (7) Overall, the authors selectively report statistical comparisons only for findings that support their claims, while most other potentially informative comparisons are omitted. Complete and transparent reporting is necessary for proper interpretation of the data.

    As explained above (comment 3), we expanded the statistical description in the methods to explain that all possible pairwise comparisons were performed and appropriately corrected for multiple comparisons, and that omitted comparisons are non-significant.

    Reviewer #1 (Recommendations for the authors):

    (1) Figure legends - The authors should provide more detailed information in the figure legends, such as N values. It is also not explained what the bold bars, as well as the highest and lowest bars, represent. Clear labeling is essential for proper interpretation of the data.

    We revised all figure legends to add n-numbers for all quantification plots, and expanded the Statistical analysis methods section to explain the labeling of all quantifications.

    (2) Presentation of plots - The authors need to improve the clarity and completeness of their figure presentations. For example:

    (a) In Figure 1F, it is unclear whether the results were obtained under chemogenetic activation, as this information is missing from both the figure and the legend. Currently, it could be a comparison of Cre+ mice with Cre- mice without any manipulations.

    (b) In Figure 1H, p-values are reported, but it is not specified which groups are being compared. As mentioned above, why are p-values only given to some comparisons? Does that mean the others are not significant?

    (c) In Figure 1D, a scale bar should be provided.

    (d) In Figure 1E, the y-axis (fluorescence) scale should be clearly indicated.

    We thank the reviewer’s attention to the figure details. We added the missing scale bars for Figures 1D-E. We also clarified in the results section that all miniscope recordings were performed under clozapine treatment. As answered above (comments 3 and 7), we expanded the methods section to state that although all comparisons were made and appropriately corrected for multiple comparisons, only significant comparisons were reported. As for the groups being compared, every significance bar clearly connects two groups, which are the ones being compared. We also expanded the Statistical Analysis section to state that “Significance bars without ticks represent pairwise comparisons, while significance bars with downward ticks represent an effect.”.

    Reviewer #2 (Public review):

    The main limitation of the study lies in its small sample sizes and the absence of key control experiments, which substantially weaken the strength of the conclusions. Core findings of this paper, such as the lack of effect of Ma2 cell activation on motor learning, as well as the altered neuronal activity, rely on a sample size of n=3 mice per condition, which is likely underpowered to detect differences in behavior and contributes to the somewhat disconnected results on calcium activity, activity timing, and neuronal assembly activity.

    We understand that the source of confusion is the number of mice used for calcium imaging and the number of mice used for assessing the effect of Ma2 increased excitability in motor learning. The core finding that Ma2 increased excitability did not alter motor learning is supported by the data shown previously in Supplemental Figure 5 (now Figure 1F-H), with n=6 Cre+ and n=7 controls, which has enough statistical power to detect the effect of training session (F (3,33) = 9.254, power = 0.997) and should have enough power to detect the effect of group (estimated power of 0.835 for F(1,11)). The behavior performance of the miniscope-recorded mice was shown in the previous version for transparency, however no conclusion was drawn based on that data. To improve clarity, we now present data from the previous Supplemental Figure 5 as Figures 1F–H. This dataset clearly demonstrates that increased excitability of Ma2 cells did not affect motor learning. In addition, note that all quantification and conclusions drawn about neuronal activity are based on robust sample sizes: 1070 cells for controls and 403 for Chrna2-Cre+, or 70 assemblies for controls and 48 for Chrna2-Cre+. These sample sizes ensure sufficient statistical power, as demonstrated by the multiple significant effects and pairwise differences reported in our study. We reiterate that no underpowered tests were conducted in this study, and no conclusions were drawn on n = 3 controls and 3 Chrna2-Cre+ mice on behavioral outcomes.

    More comprehensive analyses and data presentation are also needed to substantiate the results. For example, examining calcium activity and behavioral performance on a trial-by-trial basis could clarify whether closely spaced reaching attempts influence baseline signals and skew interpretation.

    We agree and we performed a trial-by-trial analysis to verify the effect of adjacent prehensions in the trial signal. We found that only 17.7% of adjacent trials were affected by a previous trial. In addition we selected only trials not preceded by another trial for at least 6s, and evaluated whether activity immediately before the trial (-3 to -1s) is different from the activity long before the trial (-5 to -3s). The rationale is that if a trial would affect the baseline, then activity immediately before would be different from the activity long before the trial. In this analysis, we found no genotype- or session-related differences in baseline amplitude between epochs. Together these results confirm that prehension-related activity does not systematically alter non-prehension epochs. The results are shown in Supplemental Figure 3.

    The study uses cre-negative mice as controls for hM3Dq-mediated activation, which does not account for potential effects of Cre-dependent viral expression that occur only in Cre-positive mice. This important control would be necessary to substantiate the conclusion that it is increased Ma2 cell activity that drives the observed changes in behavior and cortical activity.

    Having a control group of Cre+ mice injected with cre-dependent vector control carrying, for example, only fluorescence, would add one more layer of certainty that the effects observed here are due to CLZ-induced hM3Dq activation. We do not agree, however, that it is necessary to confirm our findings. Cre-dependent expression alone was already extensively demonstrated to have no effect by comparing a DREADD activator to a vehicle treatment (for example 10.7554/eLife.38052, 10.1523/JNEUROSCI.0537-18.2018, 10.7554/eLife.67822). We also showed this for our LFP group (Figure 4), further confirming no effect of Cre-dependent hM3Dq expression alone.

    An unspecific effect of clozapine, where the treatment affects animals without the hM3Dq receptor, would be much more likely. We do control for this by giving the same treatment to Cre+ and Cre- mice. Moreover, since we use a low dose of clozapine, a lack of hM3Dq activation would be more likely, which we also controlled for with the c-Fos experiment as explained in the answer to the Minor point 1. Nevertheless, we added to the discussion that although we find it highly unlikely that the effects found here are due to Cre-dependent viral expression, we have not recorded Cre+ animals expressing control vectors instead of hM3Dq (lines 1360-1375).

    Reviewer #2 (Recommendations for the authors):

    Major points

    (1) One of the main findings in this paper is that Chrna2-Cre cell activation did not affect learning of the prehension task; however, the presented data do not convincingly support this claim. Looking at Fig.1F, Cre+ mice appear to have an overall lower number of successful prehensions compared to control mice. If this is not statistically significant, it is likely because n=3 mice for each group is underpowered. To better judge the behavior of these mice, it would be necessary to plot success rate and overall number of prehensions over the entire course of training, in addition to successes per minute. Given that n=3, plotting all individual data points would make more sense than showing a violin plot. Relatedly, in Supplemental Figure 5, there appears to be a clear effect on reduced success rates in Cre+ mice, which is stated in the figure legends, whereas the result section states: we found no effect of genotype on prehension success rates (lines 895-896). The authors should ensure that these behavior experiments are sufficiently powered to detect potential differences in learning between groups and present the complete data and statistical analysis.

    As explained on Comment 1, the finding that Ma2 increased excitability did not alter motor learning is not based on the data on the previous Figure 1F (n=3 Cre+ and n=3 controls, shown for transparency). Instead, it is supported by the data in the previous Supplemental Figure 5, now Figures 1F-H, with n=6 Cre+ and n=7 controls, for which we found only overall effects of training session, but no effect of genotype, with no significant post-hoc pairwise comparisons. We agree that plotting the success rate, total number of prehensions and successful prehensions per minute, for all 6 sessions, allows better evaluation of the mice behavior. We moved the Supplemental Figure 5 into Figure 1, plotting the three measures for the full set of sessions, with individual data points within the violin plots, and expanded the statistical results description on the main text. We reiterate that no underpowered tests were conducted in this study, and no conclusions were drawn on n = 3 controls and 3 Chrna2-Cre+ mice.

    (2) The authors mention that a significant fraction of prehension trials overlapped with a preceding prehension attempt. Were those attempts excluded from the analysis? The stark differences in calcium signals at baseline before prehension onset in some sessions (Figure 1G, Supplementary Figure 2D) suggest that trials preceding closely in time might play a role and could skew the analysis and interpretation.

    Overlapping trials were not excluded from the previous analysis. As summarized in our response to Comment 2, and expanded in the results section (lines 876-894), we found that only 17.7% of adjacent trials were affected by a previous trial, and that when selecting only trials not preceded by another trial for at least 6s, we found no effect of prehension-related activity in the baseline preceding the trials.

    (3) Relatedly, to test the differences in calcium activity before and after prehension onset, it would be clearer to use a delta F/F measure where the 1 second before onset is used as baseline.

    Since a large proportion of neurons are more active before the onset (on the movement planning phase, Figure 2C), the activity 1s before the movement onset cannot be considered as F0. Dividing the activity during the movement by the activity during the planning phase would generate a different measure, a form of execution/planning ratio. We performed this analysis as an additional measure and found a three-way interaction effect of genotype, session, and prehension accuracy, driven by genotype effects on early sessions, indicating that Ma2 activity might be involved in the planning/execution activity balance. Those results are now described in the results section and shown at the Supplemental Figure 4.

    (4) For the experiments in which mice were trained prior to Ma2 cell activation (Fig.4), the behavior in sessions 8-10 does not seem to have reached a plateau yet, and the increase in successful prehensions in sessions 11-13 of Cre+ mice could just be a continuation of training. It would be more convincing to show the original training curve of those mice in sessions 1-7. Additionally, the authors should perform a two-way ANOVA test for the interaction of drug and genotype, rather than two separate one-way ANOVAs.

    We agree, and we now show the curve for sessions 1-7 in Supplemental Figure 7, showing that the success ratio for sessions 8-10 is similar to session 7. Also, a 2-way ANOVA was already performed, although the full report was missing from the manuscript. We switched from successful prehensions per minute to success ratio (see Reviewer #1 comment 1a) and now include the full report, in which we found an overall effect of session, and when grouping by genotype, we found an effect for Cre+ but not control mice (lines 1065-1072).

    Minor points

    (1) The validation experiment for the efficacy of hM3Dq is somewhat confusing. It is surprising that the few hM3Dq-mCherry expressing cells in the cre-negative mice did not show increased c-Fos staining since non-specific leaky hM3Dq expression would presumably still lead to a functional DREADD. The better control for validating the efficacy of hM3Dq-mediated Chrna2-Cre cell activation would be to show c-Fos staining in Cre+ mice with or without clozapine injection. This would control for non-specific c-Fos expression and neuronal activation purely by expression of the DREADD. In cre-negative control mice, the comparison should also be between mice with and without clozapine injection to control for non-specific neuronal activation regardless of hM3Dq expression.

    We thank the reviewer for raising this point and agree that validation of hM3Dq efficacy and specificity requires careful interpretation. In principle, any hM3Dq-expressing cell, including the few hM3Dq-mCherry+ cells observed in Cre– mice, could respond to clozapine. However, in practice, effective DREADD activation depends on sufficient receptor expression levels and on the pharmacodynamics of clozapine in the brain (Gomez et al., 2017, Science, 10.1126/science.aan2475). In our dataset, even in Chrna2-Cre+ mice, only ~76% of hM3Dq+ cells showed c-Fos induction after clozapine, indicating that receptor expression and/or ligand access is not uniform across cells. Consistent with this, the very sparse and weak hM3Dq expression observed in Cre- mice resulted in only 0.8% of hM3Dq+ cells showing c-Fos induction, which is in line with previous reports demonstrating that low-level “leaky” expression is insufficient to drive neuronal activation (e.g. 10.1038/s41467-019-12236-z; 10.1523/JNEUROSCI.0537-18.2018; 10.1523/ENEURO.0363-21.2021).

    The reviewer also suggests that an ideal validation would compare Cre+ mice with and without clozapine to control for any c-Fos induction driven purely by DREADD expression. We agree that such a comparison is informative, and note that in our experiments the c-Fos assay was designed specifically to test whether the low clozapine dose used (0.01 mg/kg) is sufficient to activate hM3Dq in Ma2 cells, rather than to assay baseline effects of viral expression.

    Importantly, non-specific effects of clozapine itself were controlled for throughout the study by administering the same clozapine dose to both Chrna2-Cre+ and Cre– mice in all behavioral and physiological experiments. Thus, any clozapine-driven neuronal activation independent of hM3Dq would be expected to appear in both groups.

    Together, these results indicate that (i) the clozapine dose used is sufficient to robustly activate hM3Dq-expressing Ma2 cells, (ii) sparse leaky expression in Cre– mice is not sufficient to drive measurable activation, and (iii) the effects reported in the manuscript are unlikely to be explained by non-specific clozapine actions or by viral expression alone.

    (2) The authors state in the methods section that "only neurons that displayed a significant change comparing the before onset and after onset phases" were included in the analysis. This appears to bias the data towards neurons that change their activity with the prehension movement. If this is the intention, the authors should clearly state this and their rationale in the results section and show what proportion of recorded neurons fall into this category.

    Yes, thanks for pointing this out, the explanation for this exclusion criteria is missing. We expanded the methods section “Neural activity around prehensions” to explain that since we are evaluating the role of Ma2 cells in the prehension-related activity of pyramidal cells, we excluded neurons with no prehension-related activity. We also stated in the expanded text that 15.97% of recorded neurons were excluded due to no prehension-related activity.

    (3) I don’t understand the peak PC activity latency shown in Figure 2D. How is it possible that there are negative peak latencies during the prehension phase, which is defined as >0sec, (upper right panel), and positive peak latencies in the before prehension phase, which is defined as <0sec, (lower right panel)?

    As stated in lines 939-941 and in the figure 2C legend, neurons were sorted into "before prehension" or "during prehension" neurons according to their activity during the successful prehension. One of our main findings is that the pyramidal cells temporal patterns were strongly affected by prehension accuracy (lines 941-944) meaning that a significant number of neurons shifted prehension phases when performing a failed prehension (as illustrated in Figure 2C, note how the temporal pattern is not kept from successful to failed prehensions). That is why, for failed prehensions, there are negative latencies for neurons that were classified as "during prehension" and positive latencies for neurons classified as "before prehension" in successful trials. We expanded the sorting explanation in the results section (lines 944-950) to better highlight the latency change between different prehension accuracies.

    (4) Please specify how baseline subtraction (detrending) was performed for the calcium image analysis.

    We expanded the methods section “Neural signal extraction” to better explain that we will now refer to F0 as the background component (and refer to our resulting traces as dF/F) and explain how it was determined (lines 614-619).

    (5) The authors state that they found a "dissociation between changes in neural activity and performance outcomes". Since they only analyzed motor performance by quantifying successful prehensions, this statement should be caveated with the notion that other aspects of the behavior (e.g., trajectories/speed) could be affected but were not measured.

    We agree, and expanded the discussion section to acknowledge that we focussed the behavioral aspects to success ratio, and that other measures not investigated could also be affected (lines ????-????).

    (6) Are the differences in theta and gamma power specific to the prehension trials, or does Ma2 cell activation generally increase LFP activity in those bands?

    We thank the reviewer for the question, as we had not analyzed general LFP activity in the previous version. We performed the same analysis now including only LFP from epochs outside prehension windows across the full sessions. We found that Mα2 cell activation actually reduces LFP power across all bands specifically in Session 13 when no prehension is being performed. These findings are now included as Supplemental Figure 7.

    (7) Please define terms that might not be familiar to a typical reader in the field, such as "assemblies", when first introducing them in the text.

    We revised the introduction where we now define assemblies (lines 85-88).

    (8) Please specify the n-numbers for each figure throughout the manuscript. For example, in some figures, the number of trials or the number of neurons is used; however, it is not clear what this number is.

    We agree that although the n-numbers are stated in the text, it would be clearer to add them also to the figure legends. All figure legends now contain n-numbers for panels showing quantifications.

    (9) Relatedly, while the inclusion of supplemental tables with expanded statistical results is commendable, several statistical test details are missing, such as for Figure 5.

    We have fully revised the text to add any missing statistical details for the statements in the Supplemental Tables.

  5. eLife Assessment

    This valuable study addresses a critical and timely question regarding the role of a subpopulation of cortical interneurons (Chrna2-expressing Martinotti cells) in motor learning and cortical dynamics. However, while some of the behavior and imaging data are impressive, the small sample sizes and incomplete behavioral and activity analyses make interpretation difficult; therefore, they are insufficient to support the central conclusions. The study may be of interest to neuroscientists studying cortical neural circuits, motor learning, and motor control.

  6. Reviewer #1 (Public review):

    In this study, the authors investigated a specific subtype of SST-INs (layer 5 Chrna2-expressing Martinotti cells) and examined its functional role in motor learning. Using endoscopic calcium imaging combined with chemogenetics, they showed that activation of Chrna2 cells reduces the plasticity of pyramidal neuron (PyrN) assemblies but does not affect the animals' performance. However, activating Chrna2 cells during re-training improved performance. The authors claim that activating Chrna2 cells likely reduces PyrN assembly plasticity during learning and possibly facilitates the expression of already acquired motor skills.

    There are many major issues with the study. The findings across experiments are inconsistent, and it is unclear how the authors performed their analyses or why specific time points and comparisons were chosen. The study requires major re-analysis and additional experiments to substantiate its conclusions.

    Major Points:

    (1a) Behavior task - the pellet-reaching task is a well-established paradigm in the motor learning field. Why did the authors choose to quantify performance using "success pellets per minute" instead of the more conventional "success rate" (see PMID 19946267, 31901303, 34437845, 24805237)? It is also confusing that the authors describe sessions 1-5 as being performed on a spoon, while from session 6 onward, the pellets are presented on a plate. However, in lines 710-713, the authors define session 1 as "naïve," session 2 as "learning," session 5 as "training," and "retraining" as a condition in which a more challenging pellet presentation was introduced. Does "naïve session 1" refer to the first spoon session or to session 6 (when the food is presented on a plate)? The same ambiguity applies to "learning session 2," "training session 5," and so on. Furthermore, what criteria did the authors use to designate specific sessions as "learning" versus "training"? Are these definitions based on behavioral performance thresholds or some biological mechanisms? Clarifying these distinctions is essential for interpreting the behavioral results.

    (1b) Judging from Figures 1F and 4B, even in WT mice, it is not convincing that the animals have actually learned the task. In all figures, the mice generally achieve ~10-20 pellets per minute across sessions. The only sessions showing slightly higher performance are session 5 in Figure 1F ("train") and sessions 12 and 13 in Figure 4B ("CLZ"). In the classical pellet-reaching task, animals are typically trained for 10-12 sessions (approximately 60 trials per session, one session per day), and a clear performance improvement is observed over time. The authors should therefore present performance data for each individual session to determine whether there is any consistent improvement across days. As currently shown, performance appears largely unchanged across sessions, raising doubts about whether motor learning actually occurred.

    (1c) The authors also appear to neglect existing literature on the role of SST-INs in motor learning and local circuit plasticity (e.g., PMID 26098758, 36099920). Although the current study focuses on a specific subpopulation of SST-INs, the results reported here are entirely opposite to those of previous studies. The authors should, at a minimum, acknowledge these discrepancies and discuss potential reasons for the differing outcomes in the Discussion section.

    (2a) Calcium imaging - The methodology for quantifying fluorescence changes is confusing and insufficiently described. The use of absolute ΔF values ("detrended by baseline subtraction," lines 565-567) for analyses that compare activity across cells and animals (e.g., Figure 1H) is highly unconventional and problematic. Calcium imaging is typically reported as ΔF/F₀ or z-scores to account for large variations in baseline fluorescence (F₀) due to differences in GCaMP expression, cell size, and imaging quality. Absolute ΔF values are uninterpretable without reference to baseline intensity - for example, a ΔF of 5 corresponds to a 100% change in a dim cell (F₀ = 5) but only a 1% change in a bright cell (F₀ = 500). This issue could confound all subsequent population-level analyses (e.g., mean or median activity) and across-group comparisons. Moreover, while some figures indicate that normalization was performed, the Methods section lacks any detailed description of how this normalization was implemented. The critical parameters used to define the baseline are also omitted. The authors should reprocess the imaging data using a standardized ΔF/F₀ or z-score approach, explicitly define the baseline calculation procedure, and revise all related figures and statistical analyses accordingly.

    (2b) Figure 1G - It is unclear why neural activity during successful trials is already lower one second before movement onset. Full traces with longer duration before and after movement onset should also be shown. Additionally, only data from "session 2 (learning)" and a single neuron are presented. The authors should present data across all sessions and multiple neurons to determine whether this observation is consistent and whether it depends on the stage of learning.

    (2c) Figure 1H - The authors report that chemogenetic activation of Chrna2 cells induces differential changes in PyrN activity between successful and failed trials. However, one would expect that activating all Chrna2 cells would strongly suppress PyrN activity rather than amplifying the activity differences between trials. The authors should clarify the mechanism by which Chrna2 cell activation could exaggerate the divergence in PyrN responses between successful and failed trials. Perhaps, performing calcium imaging of Chrna2 cells themselves during successful versus failed trials would provide insight into their endogenous activity patterns and help interpret how their activation influences PyrN activity during successful and failed trials.

    (2d) Figure 1H - Also, in general, the Cre⁺ (red) data points appear consistently higher in activity than the Cre⁻ (black) points. This is counterintuitive, as activating Chrna2 cells should enhance inhibition and thereby reduce PyrN activity. The authors should clarify how Cre⁺ animals exhibit higher overall PyrN activity under a manipulation expected to suppress it. This discrepancy raises concerns about the interpretation of the chemogenetic activation effects and the underlying circuit logic.

    (3) The statistical comparisons throughout the manuscript are confusing. In many cases, the authors appear to perform multiple comparisons only among the N, L, T, and R conditions within the WT group. However, the central goal of this study should be to assess differences between the WT and hM3D groups. In fact, it is unclear why the authors only provide p-values for some comparisons but not for the majority of the groups.

    (4a) Figure 4 - It is hard to understand why the authors introduce LFP experiments here, and the results are difficult to interpret in isolation. The authors should consider combining LFP recordings with calcium imaging (as in Figure 1) or, alternatively, repeating calcium imaging throughout the entire re-training period. This would provide a clearer link between circuit activity and behavior and strengthen the conclusions regarding Chrna2 cell function during re-training.

    (4b) It is unclear why CLZ has no apparent effect in session 11, yet induces a large performance increase in sessions 12 and 13. Even then, the performance in sessions 12 and 13 (~30 successful pellets) is roughly comparable to Session 5 in Figure 1F. Given this, it is questionable whether the authors can conclude that Chrna2 cell activation truly facilitates previously acquired motor skills?

    (5) Figure 5 - The authors report decreased performance in the pasta-handling task (presumably representing a newly learned skill) but observe no difference in the pellet-reaching task (presumably an already acquired skill). This appears to contradict the authors' main claim that Chrna2 cell activation facilitates previously acquired motor skills.

    (6) Supplementary Figure 1 - The c-fos staining appears unusually clean. Previous studies have shown that even in home-cage mice, there are substantial numbers of c-fos⁺ cells in M1 under basal conditions (PMID 31901303, 31901303). Additionally, the authors should present Chrna2 cell labeling and c-fos staining in separate channels. As currently shown, it is difficult to determine whether the c-fos⁺ cells are truly Chrna2 cells⁺.

    Overall, the authors selectively report statistical comparisons only for findings that support their claims, while most other potentially informative comparisons are omitted. Complete and transparent reporting is necessary for proper interpretation of the data.

  7. Reviewer #2 (Public review):

    Summary:

    In this manuscript, Malfatti et al. study the role of Chrna2 Martinotti cells (Mα2 cells), a subset of SST interneurons, for motor learning and motor cortex activity. The authors trained mice on a forelimb prehension task while recording neuronal activity of pyramidal cells using calcium imaging with a head-mounted miniscope. While chemogenetically increasing Mα2 cell activity did not affect motor learning, it changed pyramidal cell activity such that activity peaks became sharper and differently timed than in control mice. Moreover, co-active neuronal assemblies become more stable with a smaller spatial distribution. Increasing Mα2 cell activity in previously trained mice did increase performance on the prehension task and led to increased theta and gamma band activity in the motor cortex. On the other hand, genetic ablation of Mα2 cells affected fine motor movements on a pasta handling task while not affecting the prehension task.

    Strengths:

    The proposed question of how Chrna2-expressing SST interneurons affect motor learning and motor cortex activity is important and timely. The study employs sophisticated approaches to record neuronal activity and manipulate the activity of a specific neuronal population in behaving mice over the course of motor learning. The authors analyze a variety of neuronal activity parameters, comparing different behavior trials, stages of learning, and the effects of Mα2 cell activation. The analysis of neuronal assembly activity and stability over the course of learning by tracking individual neurons throughout the imaging sessions is notable, since technically challenging, and yielded the interesting result that neuronal assemblies are more stable when activating Mα2 cells.

    Overall, the study provides compelling evidence that Mα2 cells regulate certain aspects of motor behaviors, likely by shaping circuit activity in the motor cortex.

    Weaknesses:

    The main limitation of the study lies in its small sample sizes and the absence of key control experiments, which substantially weaken the strength of the conclusions.

    Core findings of this paper, such as the lack of effect of Mα2 cell activation on motor learning, as well as the altered neuronal activity, rely ona sample size of n=3 mice per condition, which is likely underpowered to detect differences in behavior and contributes to the somewhat disconnected results on calcium activity, activity timing, and neuronal assembly activity.

    More comprehensive analyses and data presentation are also needed to substantiate the results. For example, examining calcium activity and behavioral performance on a trial-by-trial basis could clarify whether closely spaced reaching attempts influence baseline signals and skew interpretation.

    The study uses cre-negative mice as controls for hM3Dq-mediated activation, which does not account for potential effects of Cre-dependent viral expression that occur only in Cre-positive mice.

    This important control would be necessary to substantiate the conclusion that it is increased Mα2 cell activity that drives the observed changes in behavior and cortical activity.