In vivo mapping of striatal neurodegeneration in Huntington’s disease with Soma and Neurite Density Imaging

Curation statements for this article:
  • Curated by eLife

    eLife logo

    eLife Assessment

    This fundamental manuscript presents a novel application of the SANDI (Soma and Neurite Density Imaging) model to study microstructural alterations in the basal ganglia of individuals with Huntington's disease (HD). The compelling methods are, to our understanding, the first application of SANDI to neurodegenerative diseases, provide strong evidence for HD-related neurodegeneration in the striatum, account significantly for striatal atrophy, and correlate with motor impairments. The integration of novel diffusion acquisition and modelling methods with multimodal behavioural data are both of high value in their own right, and create a framework for future studies.

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Abstract

Background

Huntington’s Disease (HD) is an inherited neurodegenerative disorder characterised by progressive cognitive and motor decline resulting from atrophy within basal ganglia networks. Although no disease-modifying therapies currently exist, several novel clinical trials are ongoing. Sensitive non-invasive imaging biomarkers are therefore essential for evaluating therapeutic effects. Soma and Neurite Density Imaging (SANDI), a multi-shell diffusion-weighted imaging model, estimates intracellular signal fractions arising from sphere-shaped soma that show promise as proxies for HD-related neurodegeneration. Although HD is rare, it offers a valuable model for understanding other neurodegenerative diseases due to its clear genetic cause and shared patterns of protein abnormalities.

Objective

To characterise HD-related microstructural abnormalities in the basal ganglia and thalami using SANDI and examine associations between SANDI indices, volumetric measurements, and motor performance.

Methods

T1-weighted anatomical and multi-shell diffusion-weighted images (b-values: 200 s/mm²– 6,000 s/mm²) were acquired using a 3T Siemens Connectom scanner (300mT/m) in 56 HD individuals (MeanAge = 46.1, SDAge = 13.8, 25 females) and 57 healthy controls (MeanAge = 45.0, SDAge = 13.8, 31 females). HD participants completed Quantitative Motor (Q-Motor) tasks, including speeded and paced finger tapping, which were reduced to one principal component of motor performance. Following standard diffusion-weighted data preprocessing, SANDI and diffusion tensor models estimated apparent soma density, apparent soma size, apparent neurite density, extracellular signal fraction, fractional anisotropy, and mean diffusivity. The caudate, putamen, pallidum, and thalamus were segmented bilaterally, and micro-structural and volumetric indices were extracted and compared. Correlations between SANDI indices, Q-Motor performance, and volumetric measures were analysed.

Results

HD was associated with reduced apparent soma density (rrb = 0.32, p ≤ 0.007) and increased apparent soma size (rrb = 0.45, p < 0.001) and extracellular signal fraction (rrb = 0.34, p ≤ 0.003) in the basal ganglia, but not the thalami. These differences were more pronounced at HD-Integrated Staging System 0-1 than 2-3. No differences were found in apparent neurite density (rrb = 0.18, p = 0.17). HD-related increases in fractional anisotropy and mean diffusivity in the basal ganglia were replicated. Q-Motor component scores correlated negatively with apparent soma density and positively with apparent soma size and extracellular signal fraction. SANDI indices and age explained up to 63% of striatal atrophy in HD.

Conclusion

SANDI measures detected HD-related neurodegeneration in the striatum, accounted significantly for striatal atrophy, and correlated with motor impairments. Reduced apparent soma density and increased apparent soma size align with ex vivo evidence of medium spiny neuron loss and glial reactivity. SANDI shows promise as an in vivo biomarker and surrogate outcome measure for clinical trials of disease-modifying therapies for HD and other neurodegenerative diseases.

Article activity feed

  1. eLife Assessment

    This fundamental manuscript presents a novel application of the SANDI (Soma and Neurite Density Imaging) model to study microstructural alterations in the basal ganglia of individuals with Huntington's disease (HD). The compelling methods are, to our understanding, the first application of SANDI to neurodegenerative diseases, provide strong evidence for HD-related neurodegeneration in the striatum, account significantly for striatal atrophy, and correlate with motor impairments. The integration of novel diffusion acquisition and modelling methods with multimodal behavioural data are both of high value in their own right, and create a framework for future studies.

  2. Reviewer #1 (Public review):

    (1) In this study, the authors aimed at characterizing Huntington's Disease (HD) - related microstructural abnormalities in the basal ganglia and thalami as revealed using Soma and Neurite Density Imaging (SANDI) indices (apparent soma density, apparent soma size, extracellular water signal fraction, extracellular diffusivity, apparent neurite density, fractional anisotropy and mean diffusivity).

    (2) The study implements a novel biophysical diffusion model that extends up-to-date methodologies and presents a significant potential for quantifying neurodegenerative processes of the grey matter of the human brain in vivo. The authors comment on the usefulness of this technique in other pathologies, but they exemplify only with multiple sclerosis. Further development of this, building evidence should be provided.

    (3) Study found that HD-related neurodegeneration in the striatum accounted significantly for striatal atrophy and correlated with motor impairments. HD was associated with reduced soma density, increased apparent soma size and extracellular signal fraction in the basal ganglia, but not in the thalami. Additionally, these affects were larger at manifest stage.

    (4) The results of this work demonstrate the impact of HD on basal ganglia and thalami which can be further explored as a non-invasive biomarker of disease progression. Additionally, the study shows that SANDI can be used to explore grey matter microstructure in a variety of neurological conditions.

    Comments on revised version.

    I have no further comments. Thank you

  3. Reviewer #3 (Public review):

    Summary:

    Ioakeimidis and colleagues studied miscrostructural abnormalities in N=56 Huntington's disease (HD) patients compared to N=57 normative controls. The authors used a powerful MRI Connectom scanner and applied the SANDI model to estimate the soma size, neurite size, soma density, and extracellular fraction in key subcortical nuclei related to HD. In the striatum, they found decreased soma density and increased soma size, which also seemed to become more pronounced in advanced HD individuals in the final exploratory analyses. The authors conducted important analyses to find whether the SANDI measures correlate with clinical scores (i.e., QMotor) and whether the variance of the striatal volume is explained by the SANDI measures. They found a relationship of SANDI measures to both.

    Strengths:

    The study is both innovative and of high interest for the HD community. The authors provide a rich pool of statistical analyses and results which anticipate the questions that may emerge in the HD research community. Statistics are carefully chosen and image processing is done with state-of-the-art methods and tools. The sample size gives sufficient credibility to the findings. Altogether, I think this study sets a milestone in the attempts of the HD community to understand neuropathological processes with non-invasive methods, and extends the current knowledge of microstructural anomalies identified in HD with diffusion MRI. More importantly, the newly identified anomalies in soma size and soma density open new avenues for studying these biological effects further, and perhaps develop these biomarkers for use in clinical trials.

    Weaknesses:

    (1) An important question is whether the SANDI measures, which require an expensive scanner and elaborate processing, are better biomarkers than the more traditional DTI measures. Can the authors compare the effect size of FA/MD with SANDI measures. In some of the plots and tables, FA/MD seem to have comparable, if not higher, correlations with QMotor or CAP scores. On the same vein, it is unclear whether DTI measures were included in hierarchical stepwise regression. I wonder if the stepwise models may have picked up FA/MD instead of SANDI measures if they are given a chance. Overall, I hope the authors can discuss their findings also in this light of cost vs. benefit of adopting SANDI in future studies, which is an important topic for clinical trials.

    (2) Similar to the above point, it is very important to consider how strong the biomarking signal is from SANDI measures compared to the good old striatal volume. Some plots seem to indicate that volumes still have the highest correlation with QMotor, and highest effect size in group comparisons. It would be helpful for the community to know where do the new SANDI measures stand compared to the most typically used volumes in terms of effect size.

    (3) The diffusion measures are inevitably correlated to some degree. Please provide a correlation matrix in supplementary material including all DWI measures to enable readers to understand better how similar SANDI measures are between each other or vs. other DTI measures. Perhaps adding volumes to this correlation matrix may also be a good future reference.

    (4) ISS stages:

    (a) The online ISS calculator requires cut-offs derived from the longitudinal Freesurfer pipeline, while the authors do not have longitudinal data. Thus, the ISS classification might be inaccurate to some degree if the authors used the FS cross-sectional pipeline. Please review this issue and see if updated cut-offs should be used to classify participants.
    (b) Were there really no participants with ISS 0 among 56 HD individuals, please clarify in the manuscript?
    (c) A note on terminology that might be confusing to some readers. According to the creators of ISS, the ISS stages are created for research only, they are not used or applied in the clinic. On the other hand, the terms "premanifest" and "manifest" have a clinical meaning, typically based on the diagnostic confidence level. The assignment of ISS0-1 to premanifest and ISS2-3 to manifest may create some non-trivial confusion, if not opposition, in some segments the HD community. The authors can keep their current terminology but will need to at least clarify to the reader that this assignment is speculative, does not fully match the clinically-based categories, and should not be confused with similarly named groups in the previous literature.

    Comments on revised version.

    The authors have moved to address many points from reviewers. The manuscript had indeed become more objective, transparent, and to the point. The amount of information and analyses is large, which perhaps is inevitable when new methods are being tested for the first time in a neurodegenerative disease.

  4. Author response:

    The following is the authors’ response to the original reviews.

    Reviewer #1:

    (1) The biological and/or mathematical meaning of the Soma and Neurite Density Imaging (SANDI) indices (apparent soma density, apparent soma size, extracellular water signal fraction, extracellular diffusivity, apparent neurite density, fractional anisotropy, and mean diffusivity) should be briefly introduced for those less familiar with this novel technique.

    Further explanations about the biological and mathematical meaning of the SANDI indices were added to the introduction on page 6.

    (2) The study implements a novel biophysical diffusion model that extends up-to-date methodologies and presents a significant potential for quantifying neurodegenerative processes of the grey matter of the human brain in vivo. The authors comment on the usefulness of this technique in other pathologies, but they exemplify it only with multiple sclerosis. Further development of this, building evidence, should be provided.

    Clinical applications of SANDI have primarily focused on MS. However, since preparation of the manuscript, one study has been published reporting reductions in apparent soma density and white and grey matter specific differences in apparent soma size in amyotrophic lateral sclerosis (ALS) (Zeng et al., Eur J Radiol 2025, 10.1016/j.ejrad.2025.111981). These findings accord with the loss of motor neurons and glial responses in ALS. We have added this study to the introduction of SANDI on page 7.

    (3) Why are the basal ganglia compared against thalami? The rationale of this decision is missing.

    The thalami were selected as control regions based on the established trajectory of neurodegeneration in HD, which begins with early loss of medium spiny neurons in the striatum and later extends to surrounding structures, including the putamen and thalamus. Given that most participants in our study were at early disease stages, we assumed the thalami would remain relatively unaffected in this sample. This explanation has been added to the introduction on page 7.

    (4) The use of bullet points is unusual for a scientific paper format.

    Bullet points have been removed throughout the manuscript.

    (5) The authors mention that they eroded the boundaries of the subcortical masks. Providing the details and parameters of this erosion would be beneficial.

    Details of the default parameters of the FSL erode function that was used have been added to the method section on page 13.

    (6) In the conclusion, the authors state that their results will bridge the gap between histopathological findings and in vivo imaging, but it would be helpful if they could briefly explain how they imagine such a bridge (e.g., which kind of comparisons or correlations) and whether there exists any literature in this regard so far.

    We have added the following brief explanation to the conclusion on page 26: “Although conventional MRI lacks the resolution to directly capture histopathology, advanced biophysical models such as SANDI may help bridge this gap by providing biologically interpretable parameters that reflect tissue composition and capture histopathological changes in vivo.”

    (7) The scale is missing in Figure 3.

    The scale has been added to Figure 3.

    (8) In general, the work would benefit from a better organization and potentially a smaller number of figures and tables.

    The manuscript has been re-edited to improve the readability and organization throughout and the number of figures and tables were reduced by moving some of them to the Supplementary Material (old Tables 2 and 5 are now Supplementary Tables 2 and 3, old Figure 3 is now Supplementary Figure 1).

    Reviewer #2:

    Certain aspects of the study would benefit from clarification:

    (1) Scanner and acquisition consistency: While HD data are from the WAND study, it is not clear whether controls were scanned on the same scanner or protocol. Given the use of model-derived metrics (especially SANDI), differences in scanner or acquisition could introduce confounds. From the text, the HD participants are explicitly said to come from the WAND study (a longitudinal HD cohort). On the other hand, while the HC participants are described as age-matched controls, the paper does not clearly state whether they were scanned in the same study (i.e., WAND), on the same scanner, or with the same acquisition protocol. This ambiguity is potentially problematic, especially since they use model-derived diffusion metrics that can be very sensitive to scanner hardware, gradient strengths, and protocol settings. If the WAND HD data were acquired on a specific scanner (e.g., 3T Connectom) and the HCs were not, then differences in SANDI/DTI metrics might reflect scanner bias, not disease pathology. This is particularly critical in SANDI, which is sensitive to high b-values and SNR. It would strengthen the manuscript to explicitly state whether the HD and control data were acquired using the same scanner model, sequence, and protocol, and ideally at the same site. If this were not the case, the authors should include this as a limitation and discuss any harmonization strategies applied (e.g., ComBat, covariate modeling, etc).

    For harmonization and comparison purposes, HD and control data were acquired using the same strong gradient (300mT/m) 3T Connectom MRI system at CUBRIC with the same acquisition protocols and sequences. It should also be noted that the Connectom scanner has not had any software upgrades that could introduce scanner biases in data acquired at different time points. This is now made explicit on page 8 by stating that all MRI data for all participants were acquired on the same MRI system using the same acquisition protocols, and on page 10 by stating that all HD and HC MRI data included in our analyses were acquired on the same 3T Siemens Connectom scanner at CUBRIC using the same acquisition protocols described in this section.

    Also, although it offers novel and biologically informative markers, widespread clinical translation still faces hurdles. For instance, the study used a 3T Connectom scanner (300mT/m gradients), which is not widely available. Reproduction of these results in standard 3T clinical scanners would be a great addition, in scenarios with lower resolution, less precise parameter recovery, and longer scans if SNR needs to be maintained.

    We agree that for clinical adoption it is important to demonstrate that HD-related SANDI differences can also be detected on clinical MRI systems and do not require ultra-strong gradient imaging. While we have not collected such data in people with HD, we have demonstrated the feasibility of modelling SANDI metrics from multi-shell diffusion-weighted imaging acquired on a clinical 3T MRI (maximum b-value of 6,000 s/mm2) in healthy adults and people with MS (Schiavi et al 2023, https://doi.org/10.1002/hbm.26416). Furthermore, Zeng et al 2025, reported significant differences in SANDI metrics acquired on a 3T MRI Prisma system between individuals with ALS and healthy controls (maximum b-value of 3,000 s/mm2).

    Two additional studies demonstrated that SANDI could be implemented and microstructural differences could be detected in MS using 3T scanners with standard gradient strength (Barakovic et al., 2024; Margoni et al., 2023). Collectively, these findings indicate that SANDI can be applied on clinical scanners, particularly as clinical systems move toward stronger gradient capabilities such as Siemens Magnetom Cima.X. These explanations can be found under the clinical implication section in the Discussion on page 25.

    (2) Limitations of HD-ISS staging resolution and group separation:

    The use of HD-ISS staging to anchor progression analyses is conceptually appropriate, but, in practice, the sample is quite limited.

    (a) Only 26-27 out of 56 gene-positive participants could be assigned HD-ISS stages, and none were classified into stages 0 or 4. This restricts the interpretation of progression to a narrow clinical window (mostly stages 1-3) and excludes over 50% of the cohort.

    (b) Furthermore, visual inspection of the scatter plots (e.g., Figures 3 and 4) reveals substantial overlap between stages 1 and 2, particularly in CAP100 and Q-Motor measures. This suggests that the separation between early disease stages may not be robust in this dataset, potentially due to limited power or phenotypic variability.

    (c) The above may lead to claims based on progression across HD-ISS stages to be overinterpreted or underpowered

    Despite this, the paper treats the staging as a reliable stratification for group comparisons. To improve clarity and transparency, I would recommend that the authors:

    (a) Acknowledge that over 50% of the HD cohort could not be classified.

    (b) Discuss whether those excluded differed from those included in key metrics.

    (c) Explicitly comment on the substantial overlap between stages 1 and 2, and limit claims about progression unless such separation is statistically supported.

    (d) Avoid overinterpreting staging-related effects without statistical support for group separability

    Re a-d) We have added to the study limitations on pages 23 ff that only 54% (30 out of 56) HD participants could be HD-ISS classified due to missing data, and provide an overview of demographic and clinical information for HD-ISS stages and unclassified individuals in Supplementary Table 1. We acknowledge that the combined groups (HD-ISS 0-1 versus HD-ISS 23) for exploratory group analyses did not represent discrete disease stages and that there was some overlap in imaging and behavioural features between them as illustrated in Figures 3, 4, and 7. We state explicitly that these exploratory findings should be interpreted with caution and require replication in larger, prospective cohorts before SANDI metrics can be considered as potential markers of disease progression.

    (3) Clarify regression strategy and interpretational limits of SANDI-derived regressors: While the hierarchical regression strategy is broadly appropriate, several aspects would benefit from clarification to improve both interpretability and robustness of the findings. For example:

    (a) Why were only a subset of SANDI parameters (fis and De) considered in the HC models (Figure 6), while additional metrics (fec and rs) were tested in HD models (Figures 7-8)? Including the same variables across groups could aid comparability?

    The same SANDI indices were included in regression models for HD and HC groups, Figure 7-8 report only significant predictors. This has been clarified in the figure legend and on pages 14 of the manuscript.

    (b) Were any checks for multicollinearity (e.g., variance inflation factors) conducted? Given known interdependencies among some SANDI parameters, I wonder whether some of the reported regression coefficients may be unstable or difficult to interpret.

    Cross-correlation matrices between all imaging metrics for HD, HC, and total samples have been included to Supplementary materials Figure 3.

    To improve transparency and interpretability, I suggest actions such as:

    (a) SANDI metrics included in the models differ between HC and HD groups, reducing comparability. Consider using consistent full models across ROIs for comparison purposes, even if some predictors are not significant.

    (b) Report the correlation structure between SANDI metrics within each group to assess multicollinearity (The potential impact of multicollinearity (e.g., between fis and rs) is not discussed)

    (c) Explicitly acknowledge the limitations imposed by parameter degeneracy in the SANDI model and clarify how the authors ensured the biological interpretability of regression outputs in this context - Beta coefficients could reflect model instability or parameter degeneracy rather than true biological effects.

    (a) The same SANDI metrics and age were included in the first regression models for HD and HC data. The first models only differed by the inclusion of TFC as estimate of disease burden for the HD data. HD and HC participants were not included in a single regression model, as our aim was not to perform formal between-group inference on regression coefficients. Instead, models were fitted separately to explore within-group associations and to descriptively compare patterns of relationships across groups. This approach avoids imposing identical model structures across groups that may differ in variance structure, disease burden, and biological coupling between SANDI metrics. We have clarified these points on page 13/14.

    (b) We agree that multicollinearity is an important consideration when interpreting regression coefficients derived from microstructural models. To address this, we examined pairwise Spearman correlations between all imaging (SANDI, DTI, volume) metrics (averaged across ROIs), shown in the revised Supplementary Figure 2. As can be seen in the healthy control data, SANDI indices of apparent soma and neurite fractions showed a strong inverse correlation (rho = -0.92) and did not correlate with soma radius (rho = 0.1). All SANDI indices correlated only weakly with FA and volume and moderately with MD. This correlation pattern suggests that apparent soma density and radius capture distinct information about grey matter microstructure that differs from neurite fraction and is not captured by FA or volume. We note in HD participants a negative correlation between soma radius and fraction, and stronger correlations between SANDI metrics and volume measures. We would argue that these reflect disease-related reorganization of micro- and macro-structural relationships rather than uniform collinearity across groups. This information has been added to the Methods, Results and Discussion sections on pages 13, 19, and 21, 23ff.

    (c) We agree that regression coefficients derived from interdependent microstructural parameters should be interpreted with caution, as they may reflect shared variance or partial parameter degeneracy rather than fully independent biological effects. For this reason, we do not interpret individual beta coefficients in isolation. Instead, our conclusions focus on the consistency and directionality of associations across regions and metrics, and on the overall feasibility and sensitivity of SANDI to detect biologically meaningful variation in HD. The observed correlation structure (Supplementary Figure 2) provides important context for these interpretations and supports a multivariate, pattern-based rather than univariate reading of the results. These points have been added to the Discussion on pages 23 ff. Please also refer to our response to point (5) below.

    (4) Preprocessing order:

    Gibbs ringing correction was applied after TOPUP and EDDY, which deviates from the commonly recommended order in diffusion MRI preprocessing. Since Gibbs artifacts are introduced by kspace truncation and affect the spatial domain, it is typically advised to perform Gibbs correction prior to geometric corrections like TOPUP and EDDY. This avoids potential blurring or propagation of ringing artifacts during resampling. Could the authors clarify the rationale for this ordering, and whether an early application of Gibbs correction was tested?

    We agree that the application of Gibbs ringing correction prior to TOPUP and EDDY correction deviates from the commonly recommended order in diffusion MRI preprocessing. However, as some of the data included in this paper were preprocessed before this consensus was agreed in the literature, we kept the preprocessing order consistent for all datasets for harmonization and comparison purposes. We have since changed the order for subsequent preprocessing of the HDDRUM data and have found comparable FA maps for data processed with Gibbs ringing correction before and after TOPUP and EDDY correction.

    (5) Expand on SANDI model assumptions:

    SANDI is presented as being used for the very first time in this problem. However, a vague explanation is given: "using all the default settings". Given the novelty of applying SANDI in a clinical HD context, the manuscript would benefit from a discussion of the model's key assumptions and limitations. For instance:

    (a) The potential degeneracy between fis and rs in the absence of protocol features (e.g., long Δ or high b) that can disambiguate them.

    (b) Whether a dot compartment was included, and the implications of excluding it for the interpretation of rs or fis.

    (c) The lack of exchange modeling or fixed stick diffusivity, and how these may bias compartment estimates (particularly in diseased or aging tissue).

    (d) Any steps taken to verify robustness or identifiability (e.g., simulations, synthetic fitting). These issues are not flaws in the method, but they do affect how confident we can be in interpreting fis/rs as markers of neuron loss or glial hypertrophy, especially given the subtle group differences and the potential for biological heterogeneity in HD. Even a brief acknowledgment would strengthen the manuscript and provide useful context to readers less familiar with multicompartment modeling.

    We thank the reviewer for this constructive suggestion and fully agree that, because this is the first application of SANDI in our clinical HD cohort, the manuscript should more explicitly describe the model assumptions, potential identifiability limitations under our protocol, and the implications for biological interpretation.

    We have revised the Methods (pages 11-12) and Discussion (page 24) to (i) specify the exact SANDI implementation used (the SANDI MATLAB toolbox, available at: https://github.com/palombom/SANDI-Matlab-Toolbox-Latest-Release), (ii) describe which components are included in the default formulation and the key modelling assumptions, and (iii) add a dedicated “Limitations and interpretability” paragraph addressing points (a–d) below. We also avoid the previous shorthand “default settings” and provide a clear description of the fitting setup.

    “The SANDI model [Palombo M. et al, NeuroImage 2020] assumes three compartments, namely intra-neurite signal modelled as diffusion inside impermeable randomly oriented sticks, intra-soma signal modelled as restricted diffusion inside spheres, and extra-cellular signal modelled as Gaussian isotropic diffusion. The direction-averaged (or spherical mean) normalized diffusion signal has thus the following expression:

    S(b) = fisAsphere (b, rs, Dis) + finAstick (b, Din) + fecA ball (b, De)

    where fin + fis+ fec = 1; Astick and Asphere are the normalized, directionally-averaged (or spherical mean) signals for restricted diffusion within neurites and soma, respectively and Aball is the normalized, directionally-averaged (or spherical mean) signal of the extra-cellular space. The specific expressions are given in [Palombo M. et al. NeuroImage 2020]. The parameters estimated from the direction-averaged (or spherical mean) data are Din, proxy of the intra-neurite effective axial diffusivity; De, proxy of the extracellular effective mean diffusivity; rs</sub, a proxy of apparent soma radius as well as the signal fractions subject to the constraint fin + fis + fec = 1, proxy respectively of the relaxation-weighted neurite, soma and extracellular volume fractions. The bulk diffusivity inside the sphere Dis is fixed to 3 μm2/ms. The parameters were fitted using a Random Forest regression algorithm (TreeBagger Matlab®) with 200 trees, trained on simulated data, using the code publicly available at https://github.com/palombom/SANDI-Matlab-Toolbox-Latest-Release. The training data consisted of simulated signals for 105 parameter combinations, uniformly sampled: fin and fis ∈ [0, 1], Din ∈ [0.5, 3] μm2/ms, De ∈ [0.5, 3] μm2/ms and rs ∈ [1, 12.5] μm. Rician noise with a distribution of standard deviations randomly sampled from the voxels within the brain mask of the noise map obtained using MPPCA denoising was added to account for realistic SNR levels and rectified noise floor. The loss function of the training was the mean squared error between predicted parameters and ground truth values. Model fitting provided maps of fin, fis, fe, Din, De and rs.”

    (a) Potential degeneracy between fisand rs. We agree that partial coupling (or degeneracy) between the soma fraction fis and soma radius rs is possible when the acquisition does not provide strong sensitivity to restricted sphere size (e.g., in the low b-values regime). Our protocol benefits from high b-values (up to 6000 s/mm2) enabled by the Connectom gradient system, which increases sensitivity to signal attenuation from restricted compartments and reduce the fis-rs coupling/degeneracy. However, we acknowledge that the specific choice of fixed diffusion timing (in our case δ=7 ms, Δ=24 ms) can further modulate the fis-rs coupling/degeneracy in a protocol-dependent way. To reflect this appropriately, we now explicitly state that rs should be interpreted as an “apparent soma radius” under our protocol, and that our inferences focus on relative group differences and spatial patterns rather than absolute histological soma radii.

    We have now added a paragraph in the limitations section acknowledging this point.

    (b) Dot compartment. We did not include an explicit “dot” (immobile) compartment, because there is no evidence that in human in vivo this is required (see for example very low and negligible contribution provided in Tax C. et al. NeuroImage 2020: https://www.sciencedirect.com/science/article/pii/S1053811920300215). Accordingly, our fits did not include a dot term, and we now state this explicitly in the Methods. However, we would like to clarify that our fitting method (described in details at https://github.com/palombom/SANDI-Matlab-Toolbox-Latest-Release) includes accurately the impact of Rician noise and thus it account for the corresponding rectified noise-floor that very often, in high b-values applications, is mistakenly associated with a “dot” compartment. Therefore, there is no expected bias on the estimated fis and rs due to not including a “dot” compartment.

    (c) Exchange modelling and fixed stick diffusivity. We agree that SANDI, as implemented here, does not explicitly model inter-compartment exchange during the diffusion encoding and uses simplified representations of neurites (sticks), but the intra-stick diffusivity, Din, was not fixed but rather fitted. In diseased or aging tissue, deviations from these assumptions (e.g., altered membrane permeability) may bias compartment estimates. This has been investigated in dept in Schiavi S. et al. HBM 2023 (https://onlinelibrary.wiley.com/doi/full/10.1002/hbm.26416), so we refer the redear to that. We have added an explicit limitation statement noting that HD-related microstructural changes (e.g., changes to membrane permeability) could affect model parameter fidelity, and thus fisand rs should be treated as MRI-derived effective indices rather than direct quantitative measures of neuron loss or glial hypertrophy. Importantly, our analysis compares groups under an identical acquisition and fitting pipeline, so grouplevel contrasts remain informative even if absolute parameter values are biased.

    (d) Robustness / identifiability checks. We agree that reporting robustness strengthens confidence, particularly given subtle effects and biological heterogeneity. The SANDI Matlab Toolbox we used extensively investigates model parameters robustness and identifiability using numerical simulations and synthetic signals accounting for the specific experimental protocol and noise distribution. An example of the results supporting the robustness / identifiability is reported in the Author response images. These results show that accuracy and precision of all SANDI model parameters, except Din, is very high (>~80%, Author response image 1)

    Author response image 1.

    Analysis of the accuracy and precision of SANDI model parameters estimation. We simulated 104 synthetic diffusion signals using the SANDI model with random combinations of five parameters: fneurite(fin), fsoma(fis), Din, Rsoma(rs), and De. Parameters were sampled uniformly from: fneurite, fsoma ∈ [0,1]; Din, De 𝛜[0.5,3.0] µm2/𝑚𝑠; 𝑅soma 𝛜[1,12] µm. Rician noise with experimentally estimated variance was added, and the SANDI model was then fit to the noisy signals. For each parameter, we report the relative percentage error between estimated and ground-truth values as a function of the parameter value (normalized to [0,1]), together with goodness-of-fit (R2).

    and sensitivity to changes as small as 5% in each of the model parameters is correctly captured (Author response image 2A), with small to negligible degeneracy (except, once again, for Din), even in presence of exchange (Author response image 2B).

    Author response image 2.

    Sensitivity to 5% parameter modulations. The matrices show how a controlled perturbation in one parameter propagates into the estimated values of all model parameters. Each row corresponds to a 5% increase in the parameter on the y-axis; the resulting percentage change observed in each estimated parameter is reported along the x-axis. An ideal estimator would yield a purely diagonal matrix, with 5% on the diagonal and 0% elsewhere (no cross-talk). In (A), we used the same synthetic SANDI signals as in Figure 1. In (B), we additionally generated 104 synthetic signals incorporating neurite–extra-cellular exchange using the NEXI model [https://doi.org/10.1016/j.neuroimage.2022.119277] and an exchange time representative of human cortex (𝜏ex ≈ 30 ms) [https://doi.org/10.1162/imag_a_00104].

    We have therefore revised the manuscript language to be more precise and appropriately cautious, describing fis and rs as apparent compartment indices and explicitly discussing potential confounds (e.g., parameter coupling, and unmodelled exchange), while clarifying the value of SANDI for detecting reproducible group-level microstructural differences in HD.

    (6) Clarify "not-classified" group in figures:

    It is not clear to me what the "not-classified" groups shown in Figures 3-4 represent, what criteria determined their inclusion, and whether their inclusion affects the comparability or interpretability of staging-based analyses

    We have added to the legends of Figures 3 and 4 that not-classified refers to HD participants who could not be HD-ISS classified due to missing clinical data or their CAG repeat falling within the 36-40 range. As correlation analyses were conducted across the whole HD sample though, these datapoints were included in the scatterplot.

    (7) Figure labeling:

    There appears to be a mismatch between figure numbering and captions around Figures 3-4. Please ensure alignment.

    Mismatch between figure numbering and captions has been corrected.

    Minor suggestions:

    (1) Figures 1-2:

    (a) Label axis values meaningfully, e.g., negative vs. positive instead of 0 vs 1.

    (b) Add units to MD axes (e.g., ×10⁻⁴ mm²/s).

    (c) Figure 6 colors: Consider improving the color distinction between "Age" and "fis" predictors, which are currently hard to differentiate.

    The suggested adjustments have been made to Figures 1, 2, 5 and 6 and Figure 2 legend.

    (c) Discuss why apparent soma size decreases in some ROIs (e.g., pallidum), if unexpected.

    We offer the following speculation about the reduced soma size in the pallidum (pages 20/21): Changes in apparent soma size may reflect alterations in neural and glial cell proportions and/or morphology, including astrocyte and microglia swelling in response to neurodegeneration and soma shrinkage preceding neuronal cell death. Thus, increased apparent soma size in the striatum may indicate HD-related reorganisation of cell types driven by MSN loss and reactive glial cell swelling, whereas smaller soma size in the pallidum may result from infiltration of smaller glia cells prior to secondary neuronal loss following striatal MSN degeneration.

    Reviewer #3:

    (1) An important question is whether the SANDI measures, which require an expensive scanner and elaborate processing, are better biomarkers than the more traditional DTI measures. Can the authors compare the effect size of FA/MD with SANDI measures? In some of the plots and tables, FA/MD seem to have comparable, if not higher, correlations with QMotor or CAP scores. On the same vein, it is unclear whether DTI measures were included in hierarchical stepwise regression. I wonder if the stepwise models may have picked up FA/MD instead of SANDI measures if they are given a chance. Overall, I hope the authors can discuss their findings also in this light of cost vs. benefit of adopting SANDI in future studies, which is an important topic for clinical trials.

    Effect sizes (ES) of group differences in all microstructural indices can be found in Table 4. ES of DTI and SANDI indices in the caudate and putamen were broadly comparable with a trend for MD showing larger ES (FA: rrb = 0.38 -0.55, MD: rrb = 0.51 -0.61, fis: rrb = 0.32 -0.45, rs: rrb = 0.45 0.53).

    This information is now reported in the result section on pages 15/16 and is being discussed in light of cost versus benefit considerations on pages 21 and 25.

    (2) Similar to the above point, it is very important to consider how strong the biomarking signal is from SANDI measures compared to the good old striatal volume. Some plots seem to indicate that volumes still have the highest correlation with QMotor and the highest effect size in group comparisons. It would be helpful for the community to know where the new SANDI measures stand compared to the most typically used volumes in terms of effect size.

    Effect sizes (ES) of group differences in volumes can be found in Table 2. ES in caudate and putamen volumes ranged between rrb = 0.49 -0.55 and were comparable to the ES of apparent soma size rrb = 0.45 -0.53 but slightly larger than ES of soma density rrb = 0.32 -0.45.

    This information is now reported in the result section on page 15/16 and is being discussed on pages 21 and 25.

    (3) The diffusion measures are inevitably correlated to some degree. Please provide a correlation matrix in the supplementary material, including all DWI measures, to enable readers to better understand how similar SANDI measures are to each other or vs. other DTI measures. Perhaps adding volumes to this correlation matrix may also be a good future reference.

    We have added cross-correlation matrices between all imaging measures (SANDI, DTI, Volumes) for the total sample as well as for HC and HD participants separately to the Supplementary material (Figure 3), providing an overview of the shared variance within SANDI parameters and between SANDI and DTI and volume metrics for each group.

    (4) ISS stages:

    (a) The online ISS calculator requires cut-offs derived from the longitudinal Freesurfer pipeline, while the authors do not have longitudinal data. Thus, the ISS classification might be inaccurate to some degree if the authors used the FS cross-sectional pipeline. Please review this issue and see if updated cut-offs should be used to classify participants.

    We acknowledge that our HD-ISS classifications may have been biased due to the use of crosssectional rather than longitudinal FreeSurfer v6 volumes (page 23).

    (b) Were there really no participants with ISS 0 among the 56 HD individuals? Please clarify in the manuscript.

    We classified four individuals as ISS 0 based on their caudate and/or putamen z-scored volumes falling below 2SD of the healthy control mean. These analyses are described on pages 14-15 and were based on the cross-sectional data of this study.

    (5) A note on terminology that might be confusing to some readers. According to the creators of ISS, the ISS stages are created for research only; they are not used or applied in the clinic. On the other hand, the terms "premanifest" and "manifest" have a clinical meaning, typically based on the diagnostic confidence level. The assignment of ISS0-1 to premanifest and ISS2-3 to manifest may create some non-trivial confusion, if not opposition, in some segments of the HD community. The authors can keep their current terminology, but will need to at least clarify to the reader that this assignment is speculative, does not fully match the clinically-based categories, and should not be confused with similarly named groups in the previous literature.

    To avoid confusion about terminology, we have removed the labels “premanifest” versus “manifest” throughout the manuscript. We refer to HD-ISS 0-1 and HD-ISS 2-3 when referring to the exploratory comparisons between HD-ISS stages.

    (6) The population in the study seems to be obtained from different other studies or research projects, and there are missing scores for several participants due to the retrospective nature of sample gathering for the analyses. Please state clearly that this study was done with retrospective data to properly justify why there are missing data. Also, and this is important, please clarify for the reader whether there was any temporal bias in the acquisition of data of a certain group (HD) vs. another (HC). It is important to rule out that there were no scanner changes or upgrades that may confound the reported group differences.

    We can confirm there were no Connectom scanner changes or upgrades that may have confounded the reported group differences. This was added to the image acquisition section on page 10. We have added to the participant section on page 9 that data were retrospectively pooled from separate studies and explain this was the reason why HD-ISS classification was only available for a subset of participants.

    (7) Several of the significant results with SANDI scores seem to be driven by a subgroup of HD individuals that are more clearly different than the healthy control distribution. Not sure if this may help, but one idea the authors can consider is to check if HD individuals that deviate more than 2 SDs from the healthy control distribution of SANDI scores have also worse QMotor, worse atrophy, or higher CAP scores than those HD individuals that are practically within the 2SD boundary distribution of HDs. This is another way of showing that the new measures have potential for application in individualized medicine (the MRI Z score of a patient as a proxy of the clinical deterioration). It is not a request to authors but just a suggestion for their consideration.

    The data points in the scatterplots of Figures 3, 4, and 7 have now been color-coded according to HD-ISS stage, showing a stage-related worsening of microstructural and volumetric imaging markers and Q-Motor performance.

    (8) The variance explained in hierarchical regression is obtained by fitting models within the sample, and can be subject to overfitting. In the absence of a more robust cross-validated R2, the authors may want to at least briefly inform the reader that the current approach can be subject to overfitting and does not represent a true out-of-sample R2.

    We have added this point to the study limitations in the Discussion section on page 23.

    (9) There are two Figure 3 labels, and all figures thereafter do not match the manuscript.

    The Figure numbering has been corrected.

    (10) In (the currently labelled) Figure 8, there seem to be fewer than 56 data points in the scatterplots. Is there a reason why not all 56 HD individuals do not have the CAP100 score available? CAP needs only CAG and age, which all HD gene carriers should have, to be included in the study.

    Inclusion criteria for individuals with HD for the HD-DRUM project were a positive genetic test for the presence of the mutant huntingtin allele (CAG length ≥ 36 repeats) and/or a clinical diagnosis of HD. Thus, for a small number of participants CAG was not available for the calculation of CAP100 score.

  5. eLife Assessment

    This important manuscript presents a novel application of the SANDI (Soma and Neurite Density Imaging) model to study microstructural alterations in the basal ganglia of individuals with Huntington's disease (HD). The compelling methods, to our understanding, the first application of SANDI to neurodegenerative diseases, provide strong evidence for HD-related neurodegeneration in the striatum, account significantly for striatal atrophy, and correlate with motor impairments. The integration of novel diffusion acquisition and modelling methods with multimodal behavioural data are both of high value in their own right, and create a framework for future studies.

  6. Reviewer #1 (Public review):

    (1) In this study, the authors aimed at characterizing Huntington's Disease (HD) - related microstructural abnormalities in the basal ganglia and thalami as revealed using Soma and Neurite Density Imaging (SANDI) indices (apparent soma density, apparent soma size, extracellular water signal fraction, extracellular diffusivity, apparent neurite density, fractional anisotropy and mean diffusivity).

    (2) The study implements a novel biophysical diffusion model that extends up-to-date methodologies and presents a significant potential for quantifying neurodegenerative processes of the grey matter of the human brain in vivo. The authors comment on the usefulness of this technique in other pathologies, but they exemplify it only with multiple sclerosis. Further development of this, building evidence, should be provided.

    (3) The study found that HD-related neurodegeneration in the striatum accounted significantly for striatal atrophy and correlated with motor impairments. HD was associated with reduced soma density, increased apparent soma size, and extracellular signal fraction in the basal ganglia, but not in the thalami. Additionally, these effects were larger at the manifest stage.

    (4) The results of this work demonstrate the impact of HD on the basal ganglia and thalami, which can be further explored as a non-invasive biomarker of disease progression. Additionally, the study shows that SANDI can be used to explore grey matter microstructure in a variety of neurological conditions.

  7. Reviewer #2 (Public review):

    Summary:

    The authors aimed to investigate whether advanced microstructural diffusion MRI modeling using the SANDI framework could reveal clinically relevant tissue alterations in the subcortical structures of individuals with Huntington's disease (HD). Specifically, they sought to determine if SANDI-derived parameters-such as soma density, soma size, and extracellular diffusivity-could detect abnormalities in both manifest and premanifest HD stages, complement standard MRI biomarkers (e.g., volume, MD), and correlate with disease burden and motor impairment. Through this, they hoped to demonstrate the feasibility and added biological specificity of SANDI for early detection and characterization of HD pathology.

    Strengths:

    (1) Novelty and relevance:

    This is, to the best of my knowledge, the first clinical deployment of SANDI in HD, offering more biophysically interpretable and specific imaging biomarkers than standard DTI or volumetric features.

    (2) More specific microstructural insight: Traditional approaches have used volumetric features (e.g., striatal volume loss) or DTI metrics (like FA and MD), which are indirect and non-specific markers. They can indicate something is "wrong" but not what is wrong.

    (3) SANDI parameters permit establishing clearer links with microstructure:

    o Apparent soma density (fis): proxy for neuronal/glial cell body density.

    o Apparent soma size (rs): reflects possible gliagl hypertrophy or neuronal shrinkage.

    o Neurite density (fin): linked to dendritic/axonal integrity.

    o Extracellular fraction and diffusivity: sensitive to edema, gliosis, and tissue loss.

    In this way, a decrease in soma density can be related to neural loss (e.g., medium spiny neurons), and an increase in soma size and extracellular fraction could be related to glial reactivity (astrocytes, microglia). This enables differentiating between atrophy due to neuron loss vs reactive gliosis, which volumetrics or DTI cannot do.

    (4) Integration of modalities: The inclusion of motor impairment (Q-Motor), HD-ISS staging, and multi-compartment diffusion modeling is a methodological strength.

    (5) Early detection potential: SANDI metrics showed abnormalities in premanifest HD, sometimes even when volume loss was mild or absent. This suggests the potential for earlier, more sensitive biomarkers of disease progression.

    (6) Predictive power: Regression models showed that SANDI metrics explained up to 63% of the variance in striatal volumes in HD. And this correlated strongly with motor impairment and disease burden (CAP100). This shows they are not just redundant with volume or DTI, but they are complementary and potentially more mechanistically meaningful.

    Weaknesses:

    Certain aspects of the study would benefit from clarification:

    (1) Scanner and acquisition consistency: While HD data are from the WAND study, it is not clear whether controls were scanned on the same scanner or protocol. Given the use of model-derived metrics (especially SANDI), differences in scanner or acquisition could introduce confounds. Also, although it offers novel and biologically informative markers, widespread clinical translation still faces hurdles. For instance, the study used a 3T Connectom scanner (300mT/m gradients), which is not widely available. Reproduction of these results in standard 3T clinical scanners would be a great addition, in scenarios with lower resolution, less precise parameter recovery, and longer scans if SNR needs to be maintained.

    (2) HD-ISS staging and group comparisons:
    a) Only 26-27 out of 56 gene-positive participants could be assigned HD-ISS stages, and none were classified into stages 0 or 4.

    b) Visual overlap between stages 1 and 2 in behavioral and imaging features suggests that staging-based group separation may not be robust.

    c) The above may lead to claims based on progression across HD-ISS stages to be overinterpreted or underpowered

    (3) Regression modeling choices:
    a) SANDI metrics included in the models differ between HC and HD groups, reducing comparability.

    b) The potential impact of multicollinearity (e.g., between fis and rs) is not discussed.

    c) Beta coefficients could reflect model instability or parameter degeneracy rather than true biological effects.

    These issues do not undermine the study's main conclusions, which effectively demonstrate the feasibility and initial clinical relevance of applying SANDI to HD. Nonetheless, addressing them more thoroughly would enhance the clarity and interpretability of the manuscript.

  8. Reviewer #3 (Public review):

    Summary:

    Ioakeimidis and colleagues studied microstructural abnormalities in N=56 Huntington's disease (HD) patients compared to N=57 normative controls. The authors used a powerful MRI Connectom scanner and applied the SANDI model to estimate the soma size, neurite size, soma density, and extracellular fraction in key subcortical nuclei related to HD. In the striatum, they found decreased soma density and increased soma size, which also seemed to become more pronounced in advanced HD individuals in the final exploratory analyses. The authors conducted important analyses to find whether the SANDI measures correlate with clinical scores (i.e., QMotor) and whether the variance of the striatal volume is explained by the SANDI measures. They found a relationship between SANDI measures for both.

    Strengths:

    The study is both innovative and of high interest for the HD community. The authors provide a rich pool of statistical analyses and results that anticipate the questions that may emerge in the HD research community. Statistics are carefully chosen and image processing is done with state-of-the-art methods and tools. The sample size gives sufficient credibility to the findings. Altogether, I think this study sets a milestone in the attempts of the HD community to understand neuropathological processes with non-invasive methods, and extends the current knowledge of microstructural anomalies identified in HD with diffusion MRI. More importantly, the newly identified anomalies in soma size and soma density open new avenues for studying these biological effects further and perhaps developing these biomarkers for use in clinical trials.

    Weaknesses:

    (1) An important question is whether the SANDI measures, which require an expensive scanner and elaborate processing, are better biomarkers than the more traditional DTI measures. Can the authors compare the effect size of FA/MD with SANDI measures? In some of the plots and tables, FA/MD seem to have comparable, if not higher, correlations with QMotor or CAP scores. On the same vein, it is unclear whether DTI measures were included in hierarchical stepwise regression. I wonder if the stepwise models may have picked up FA/MD instead of SANDI measures if they are given a chance. Overall, I hope the authors can discuss their findings also in this light of cost vs. benefit of adopting SANDI in future studies, which is an important topic for clinical trials.

    (2) Similar to the above point, it is very important to consider how strong the biomarking signal is from SANDI measures compared to the good old striatal volume. Some plots seem to indicate that volumes still have the highest correlation with QMotor and the highest effect size in group comparisons. It would be helpful for the community to know where the new SANDI measures stand compared to the most typically used volumes in terms of effect size.

    (3) The diffusion measures are inevitably correlated to some degree. Please provide a correlation matrix in the supplementary material, including all DWI measures, to enable readers to better understand how similar SANDI measures are to each other or vs. other DTI measures. Perhaps adding volumes to this correlation matrix may also be a good future reference.

    (4) ISS stages:

    a) The online ISS calculator requires cut-offs derived from the longitudinal Freesurfer pipeline, while the authors do not have longitudinal data. Thus, the ISS classification might be inaccurate to some degree if the authors used the FS cross-sectional pipeline. Please review this issue and see if updated cut-offs should be used to classify participants.

    b) Were there really no participants with ISS 0 among the 56 HD individuals? Please clarify in the manuscript.

    (5) A note on terminology that might be confusing to some readers. According to the creators of ISS, the ISS stages are created for research only; they are not used or applied in the clinic. On the other hand, the terms "premanifest" and "manifest" have a clinical meaning, typically based on the diagnostic confidence level. The assignment of ISS0-1 to premanifest and ISS2-3 to manifest may create some non-trivial confusion, if not opposition, in some segments of the HD community. The authors can keep their current terminology, but will need to at least clarify to the reader that this assignment is speculative, does not fully match the clinically-based categories, and should not be confused with similarly named groups in the previous literature.

  9. Author response:

    Response to Reviewer 1:

    Ad (2) Clinical applications of SANDI have primarily focused on Multiple Sclerosis. However, since the preparation of the manuscript, one study has been published reporting reductions in apparent soma density and white and grey matter differences in apparent soma size in amyotrophic lateral sclerosis (ALS) (https://doi.org/10.1016/j.ejrad.2025.111981). We will include this paper in our revised manuscript.

    Responses to Reviewer 2:

    Strength:

    Ad (3) SANDI cannot directly differentiate between neural and glia cells but the pattern of differences in the SANDI parameters we observed in Huntington’s disease (HD) are consistent with the known pathology in HD.

    Weaknesses:

    Ad (1) With regards to the question about scanner and acquisition consistency, we can confirm that all diffusion data of individuals with HD and healthy controls from the WAND study were acquired with the same multi-shell High Angular Resolution Diffusion Imaging (HARDI) protocol on the 3T Connectom scanner at CUBRIC. Thus, all diffusion data analysed and reported in this manuscript were acquired with the same protocol on the same strong gradient MRI system for harmonization and consistency purposes.

    We agree that for clinical adoption it is important to demonstrate that HD-related SANDI differences do not require ultra-strong gradient imaging and can be detected on standard clinical MRI systems. While we have not collected such data in people with HD, we and others have demonstrated the feasibility of modelling SANDI metrics from multi-shell diffusion-weighted imaging data acquired with maximum b-value 3,000 s/mm2 on clinical 3T MRI system in typical adults and people with MS or ALS (https://doi.org/10.1002/hbm.26416, https://doi.org/10.1038/s41598-024-60497-6, https://doi.org/10.1016/j.ejrad.2025.111981). These studies have demonstrated that it is feasible to characterise brain microstructural differences with SANDI on clinical scanners and that comparable patterns of results can be observed across different MRI systems. It should also be noted that there is presently a move towards stronger gradient implementation in clinical systems as demonstrated by the release of the Siemens Cima.X system which will allow higher b-value diffusion scanning on clinical systems.

    ad (2) We agree that due to the small number of HD participants with HD-ISS staging the exploratory comparisons between ISS stages need to be interpreted with caution. We hope to gain access to some of the missing ISS information and plan to include these in the revised paper.

    Ad (3) With regards to the queries about the regression modelling choices:

    (1) As SANDI metrics differed between HC and HD groups, and hence may not be directly comparable, separate regression models for HC and HD data were conducted without formal comparisons between slopes. Only descriptive exploratory comparisons of the observed pattern were included.

    (2) We will provide cross-correlational analyses between all SANDI parameters in the supplements of the revised version of the paper to check for multicollinearity.

    (3)All model-based approaches, including SANDI, may be prone to model instability or parameter degeneracy and we will acknowledge and discuss this in the revised version.

    Responses to Reviewer 3:

    Weaknesses:

    Ad (1) and (2) The effect sizes (ES) of group differences in SANDI, DTI, and volume measures in the caudate and putamen (Tables 3 and 4) were broadly comparable: apparent soma radius rs (rrb = 0.45 -0.53), apparent soma size fis (rrb = 0.32 -0.45), FA (rrb = 0.38 -0.55), MD (rrb = 0.51 -0.61) and volumes (rrb = 0.49 -0.55 ). Similar ES were observed between fis and FA, and between rs and volumes. MD showed the largest ES, likely due to striatal atrophy-related CSF partial volume contamination.Cost-benefit analyses of imaging marker choices in clinical trials depend on the aim of the study. DTI provides sensitive but unspecific indices that are influenced by biological and geometrical tissue properties and capture a multitude of microstructural properties. Similarly, volumetric measurements do not inform about the underpinning neurodegenerative processes.

    With the advancement of disease-modifying therapies for HD it has become important to identify non-invasive imaging markers that can inform about the mechanistic effects of novel therapies. While DTI and volume metrics are sensitive to detect brain changes, they do not provide specific information about the underpinning tissue properties. Such information, however, may turn out to be important for the evaluation of mechanistic effects of novel therapeutics in clinical trials. Advanced microstructural models such as SANDI may help provide such information. We found that SANDI indices had statistically similar power to the gold standard measures of volumes, but with the added value of information underpinning microstructure. We and others have also shown that SANDI can be applied to multi-shell diffusion data acquired in a clinically feasible time (~10 min) on standard 3T MRI systems (please refer to our response above).

    To summarise, DTI and volumes are sensitive to brain changes but will need to be complemented by more advanced microstructural measurements such as SANDI to gain a better understanding of the underlying tissue changes and effects of disease-modifying therapies.

    Ad (3) We will provide a correlation matrix of all DWI measures in supplementary material to allow a better understanding how similar SANDI measures are to each other and compared to DTI measures.

    Ad (4) Most of the people with HD who have taken part in our study were participants in the Enroll-HD study. We will use HD-ISS information from ENROLL as much as possible. As we do not have longitudinal imaging data for all individuals classified as ISS <2, we will compare our cross-sectional striatal volumes with those from age and sex matched individuals from WAND to determine whether people fall into ISS 0 or 1 category. This approach will hopefully allow us to increase the total HD-ISS sample size and to determine whether there were participants with ISS 0 in our sample.

    Ad (5) We will explain in the revised manuscript that ISS stages are created for research only purposes and are not used or applied in clinic, while “premanifest” and “manifest” are helpful concepts in the clinical context. We will clarify that we refer to individuals without motor symptoms as assessed with Total Motor Score (TMS) as premanifest and to those with motor symptoms as manifest. This roughly corresponds to individuals at ISS 0/1 without signs of motor symptoms compared to individuals at ISS 2-3 with signs of motor symptoms.