Heart rate modulation and clinical improvement in major depression: A randomized clinical trial with accelerated intermittent theta burst stimulation
Intermittent theta burst stimulation (iTBS) is a promising noninvasive treatment for major depressive disorder (MDD), though significant variability exists in patient responses. This quadruple-blind, sham-controlled crossover randomized clinical trial investigated whether clinical improvement was influenced by the spatial selection of stimulation sites or early cardiac rhythm modulations. Seventy-five patients with MDD (33 women) were randomized to either personalized stimulation sites—identified via the strongest negative functional connectivity between the left dorsolateral prefrontal cortex and the default mode network—or fixed stimulation at the F3 position of the 10–20 EEG system. Electrocardiograms throughout stimulation monitored heart rate and its variability, focusing on heartbeat deceleration heart rate (RR) slope and changes in the root mean square of successive differences between heartbeats (RMSSD). Findings demonstrated that a higher RR slope within the first 45 s of stimulation predicted greater clinical improvement at 6 weeks, while a lower RMSSD change during the first 270 s correlated with improvement at 1 week. However, no significant differences in outcomes were observed between personalized and fixed stimulation sites. This study identified a significant relationship between early heart rhythm modulations and the antidepressant effects of accelerated iTBS in MDD. Although the spatial determination of stimulation sites did not enhance clinical improvement, iTBS-induced changes in cardiac rhythm during early sessions may develop into valuable biomarkers for stratifying patients, enabling more personalized and effective treatment strategies in the future.
Introduction
Major depressive disorder (MDD) is one of the most common psychiatric disorders and the leading cause of disability worldwide (1). With a lifetime prevalence of 16%–20%, MDD treatment is particularly challenging for approximately one-third of patients who experience treatment-resistant depression. These patients fail to respond to two or more antidepressant medication trials of adequate dose and duration (2). In this context, repetitive transcranial magnetic stimulation (rTMS) has gained attention as an effective treatment modality (3, 4). More recently, advances in stimulation protocols such as intermittent theta burst stimulation (iTBS) (5), along with the expansion of neurostimulation in clinical settings, have encouraged the application of rTMS as a first line of treatment for MDD (6, 7). Although access to and familiarity with iTBS has increased, clinical outcomes remains extremely variable and difficult to predict. In the field of precision medicine, neurobiological predictors have been proposed as potential markers of clinical improvement in patients undergoing rTMS. For example, the selection of customized stimulation sites based on individual functional connectivity and the identification of heart rhythm modulations at the onset of stimulation are very promising candidates (8–11). Thus, in this randomized clinical trial (RCT), we prospectively evaluated the clinical improvement of spatially personalized stimulation sites as well as subsequent heart rate (HR) and HR variability modulations. Both strategies suggest that the degree of communication from the site of stimulation in the left dorsolateral prefrontal cortex (DLPFC) to the subgenual anterior cingulate cortex (sgACC) is likely implicated in clinical improvement (12–14).
The most studied and successful coil position for the treatment of clinical depression has been in the left DLPFC (15). Historically, the position site was determined using the “5cm rule” or “7cm rule,” referring to the distance from the point where the resting motor threshold (RMT) was established to the prefrontal region. However, this motor system-based approach has been questioned over the years, as anatomical locations in the DLPFC have not been consistent (16, 17). More recently, the default mode network (DMN), and the sgACC in particular, has emerged as a promising references for coil positioning, given its relevance to the pathophysiology of depression and its association with clinical improvement (18–20).
One approach to optimize coil placement utilizes resting-state functional magnetic resonance imaging (MRI) connectivity of the individual. Building on evidence of a physiological anticorrelation between the frontoparietal network and the DMN (21), an rTMS study demonstrated that reducing the DMN hyperconnectivity in MDD, extending to the sgACC, was associated with symptomatic relief (18, 20). Subsequent studies have replicated an association between greater therapeutic success with rTMS and the degree of anticorrelation of the individual left DLPFC and a general sgACC seed (10). However, since a general sgACC seed may not reflect the individual sgACC site implicated in symptoms, we validated a protocol using the same anticorrelation concept, but with a focus on the strongest anticorrelation between the left DLPFC and the individual DMN. This approach demonstrated that rTMS resulted in functional disconnection of the DMN up to the sgACC region in healthy volunteers (22, 23). Translationally, we hypothesized that the closer the ideal stimulation site was to the individualized target, the greater the clinical benefit of the iTBS protocol would be for depressed patients and we tested both DMN and sgACC associations, as previously replicated.
A novel approach, termed neuro-cardiac-guided (NCG)-TMS, involves stimulating the left DLPFC and, possibly through the sgACC, exerting modulations on HR and HR variability. This is thought to reflect the successful activation of the frontal vagal pathway, which would enhance the vagal tone, reducing HR and/or increasing HR variability. Additionally, research in healthy volunteers has suggested that stimulation at the F3 position of the 10–20 EEG system is optimal for inducing early HR changes with rTMS (24). Compared to healthy controls (HCs), depressed patients exhibit altered HR and HR variability (25, 26), and these alterations are not always reversed by treatment (27). However, another study using iTBS observed a trend toward an association between HR decelerations and clinical improvement in depression (8), suggesting that HR modulations may serve as a predictor for treatment response.
In the current RCT, we rigorously tested the two previously suggested approaches to enhance the clinical benefits of rTMS treatment in depression: (1) spatial customization of coil position in the left DLPFC based on resting-state functional connectivity data and (2) early modulations of cardiac rhythm. The primary outcome was clinical improvement over a 6-week period, as assessed by the Montgomery-Åsberg Depression Rating Scale (MADRS) score in patients with major depression treated with accelerated iTBS. Based on the existing evidence, we hypothesized that both the personalized selection of stimulation sites (personalized > F3 site) and the early iTBS-induced modulations of HR (as measured by RR slope) and HR variability change [as measured by RMSSD (root mean square of successive difference) minus baseline] would be positively associated with greater clinical improvement.
Results
Of the 125 patients screened, 92 were enrolled in the study (Figure 1). Of the 33 patients who were not enrolled, 18 were excluded and 15 did not consent to participate, mainly due to time constraints. Of the excluded patients, 7 met the diagnostic criteria for bipolar depression, 3 did not meet the diagnostic criteria for MDD, 4 had claustrophobia, 3 were on recently adjusted medication, and 1 presented an incidental finding in the MRI scan. Of the 92 participants, 11 dropped out during the 6-week course of the study, with no reports of serious adverse effects (see Safety section below). The reasons for dropping out of the study were rTMS-unrelated issues (e.g., symptoms of a cold), time limitations, change in medication, etc. We also excluded 6 patients from the final analysis due to errors in the electrocardiogram (ECG) acquisition (incomplete acquisition, or extreme values detected as outliers), resulting in 75 participants.


Citation: Brain Medicine 2025; 10.61373/bm025a.0113
Characteristics of the sample
We assessed the age, sex, years of education, medication use, treatment-resistant depression, comorbidities, and expectation of effect (blinding) of the participants in total and in each group (Table 1). Except for sex (χ2(1) = 10.45, P = 0.001), no other characteristic was different between groups. The number of patients on stable medication did not differ between groups (χ2(1) = 0.03, P = 0.872).
Total | Personalized group | F3 group | ||||
---|---|---|---|---|---|---|
N (%) | M ± SD | N (%) | M ± SD | N (%) | M ± SD | |
Number of subjects | 75 (100.00) | 36 (48.00) | 39 (52.00) | |||
Order | ||||||
Active-sham | 37 (49.33) | 17 (47.22) | 20 (51.28) | |||
Sham-active | 38 (50.67) | 19 (52.78) | 19 (48.72) | |||
Sexa | ||||||
Male | 42 (56.00) 33 | 15 (41.67) 21 | 27 (69.23) 12 | |||
Female | (44.00) | (58.33) | (30.77) | |||
Age | 36.14 ± 13.09 | 33.83 ± 12.02 | 38.28 ± 13.74 | |||
Years of education | 16.42 ± 3.48 | 16.70 ± 3.74 | 16.16 ± 3.24 | |||
Medication use | ||||||
Yes | 57 (76.00) | 29 (80.56) | 28 (71.79) | |||
No | 18 (24.00) | 7 (19.44) | 11 (28.21) | |||
Medication | ||||||
Load | 1.91 ± 1.77 | 2.13 ± 1.95 | 1.71 ± 1.58 | |||
MADRSb at baseline | 24.55 ± 7.26 | 24.32 ± 7.45 | 24.79 ± 7.09 | |||
Treatment resistance | ||||||
Yes | 29 (38.67) | 18 (50.00) | 11 (28.21) | |||
No | 46 (61.33) | 18 (50.00) | 28 (71.79) | |||
Number of subjects with comorbiditiesc | ||||||
Presence of generalized anxiety | 20 (26.67) | 12 (33.33) | 8 (20.51) | |||
Panic disorder | 7 (9.33) | 3 (8.33) | 4 (10.26) | |||
Eating disorder | 3 (4.00) | - | 3 (7.69) | |||
Social phobia | 2 (2.67) | 2 (5.56) | – | |||
Obsessive compulsive | 6 (8.00) | 3 (8.33) | 3 (7.69) | |||
disorder | 2 (2.67) | 1 (2.78) | 1 (2.56) | |||
PTSDd | 2 (2.67) | 2 (5.56) | – | |||
Somatoform | 1 (1.33) | 1 (2.78) | – | |||
ADHDe | 1 (1.33) | 1 (2.78) | – | |||
Expectation of effect (VAS 0-1) | ||||||
Active | 0.43 ± 0.29 | 0.45 ± 0.30 | 0.41 ± 0.28 | |||
Sham | 0.38 ± 0.29 | 0.39 ± 0.29 | 0.38 ± 0.30 |
aP < 0.001.
bMADRS: Montgomery-Åsberg Depression Rating Scale.
cComorbidities (χ2(1) = 0.8, P = 0.371): not included as covariates.
dPTSD: posttraumatic stress disorder.
eADHD: attention-deficit hyperactivity disorder.
Safety and side effects
Of the 75 participants, 53.33% reported headache, 30.00% neck pain, 58.00% scalp pain, and 20.67% scalp irritation at the stimulation site. The intensity of daily pain/irritation experienced (Likert 0–10) was, however, very low on average in the active condition [M(SD): 1.47 (1.19) headache, 1.31 (1.35) neck pain, 2.31 (1.73) scalp pain, 1.40 (1.03) scalp irritation] as well as in the sham condition [M(SD): 1.01 (0.99) headache, 1.24 (1.32) neck pain, 1.06 (0.90) scalp pain, 0.81 (0.55) scalp irritation]. The intensity of daily pain/irritation between conditions differed (P < 0.0125) only for headache (W = 3663.5 P = 0.007 for headache, W = 2688 P = 0.564 neck pain, W = 3369.5 P = 0.029 scalp pain, W = 2744.5 P = 0.718 scalp irritation). No participant reported serious adverse effects.
Personalized versus F3 stimulation
We found no significant interaction effect of time and sex in the two groups [F(4,295) = 0.733, P = 0.57]. Furthermore, no two-way interactions were significant. There was a significant effect of time [F(4,295) = 14.378, P < 0.001]. In the pairwise comparisons, a significant reduction of MADRS was found in F3 from V1 to all visits except V3, whereas in the personalized, only V6 showed a significant reduction compared to V1 after Bonferroni correction. Most importantly, patients in the F3 when compared to the personalized stimulation displayed no significant difference in MADRS scores at the first as well as the last visit (pairwise comparison P = 0.494, P = 0.490) (Figure 2, panel 1). Therefore, there was a significant reduction in MADRS scores in both groups, but no significant difference between personalized and standard F3 stimulation was found in the V6 evaluation. Finally, although difficult to visualize, the within-week reductions in the overall MADRS score in the active condition were significantly larger than the sham in the linear mixed model (LMM) (Table 2), as seen earlier in (28).


Citation: Brain Medicine 2025; 10.61373/bm025a.0113
Delta MADRS scorea | |||
---|---|---|---|
Predictors | Estimates | CI | P |
(Intercept) | −0.158 | −0.372 to 0.055 | 0.145 |
condition [active] | 0.145 | 0.029–0.261 | 0.015b |
RMSSDc change 270 s | 0.009 | −0.002 to 0.019 | 0.096 |
distance to optimal stimulation site | 0.003 | −0.004 to 0.010 | 0.385 |
stimulation order [sham_first] | 0.054 | −0.064 to 0.172 | 0.370 |
Age | 0.005 | 0.001–0.010 | 0.024b |
Sex [men] | 0.027 | −0.095 to 0.149 | 0.659 |
Medload | −0.017 | −0.053 to 0.019 | 0.363 |
TRDd | 0.002 | −0.125 to 0.128 | 0.980 |
Condition [active]*RMSSD change 270 s | −0.017 | −0.029 to -0.004 | 0.008e |
Random effects | |||
σ2 | 0.12 | ||
τ00 IDnew | 0.00 | ||
N IDnew | 75 | ||
Observations | 143 | ||
Marginal R2/Conditional R2 | 0.139 / NA |
a1 being a 100% reduction of depressive symptoms.
bP < 0.05.
cRMSSD: root mean square of successive differences between heartbeats.
dTRD: treatment-resistant depression.
eP < 0.01.
Next, we investigated whether the MADRS score changed according to the distance between the stimulated and ideal sites. The mean Euclidean distance between ideal and stimulated sites was obviously shorter (W = 4826, P < 0.001) in the personalized (M ± SD: 7.27 ± 4.76 mm) as compared to the F3 (M ± SD: 18.20 ± 9.17 mm) group. However, the correlation values for active (r = 0.17, P = 0.14) and sham (r = 0.08, P = 0.48) conditions did not hint toward any relationship (Figure 2, panel 2). In addition, we projected the stimulation sites from the native to the standard space in order to visualize the spatial distribution of the areas in personalized and F3 groups coded with clinical response ranges (Figure 2, panel 3).
Heart rate and heart rate variability
We assessed whether iTBS-driven modulations in the RR interval (45 s) and RMSSD (270 s) would be associated with clinical benefit (delta MADRS score: 1 being a 100% reduction of depressive symptoms). Therefore, we correlated slopes and delta RMSSD with MADRS change and found a significant positive relationship for slope in the active condition only (r = 0.27, P = 0.021; Figure 3A), which was also significantly higher than the sham condition [t(74) = 2.92, P = 0.005, d = 0.41 (CI: 0.12 – 0.69); Figure 3B]. A relationship was not seen for correlation of delta MADRS score and RMSSD change in the active (r = − 0.17, P = 0.15) or sham (r = 0.05, P = 0.7) conditions. When assessing the delta MADRS within a week, we identified a significant negative correlation with RMSSD change in the active condition only (r = − 0.29, P = 0.013; Figure 3C), which was also significantly higher than the sham condition (W = 3402, P = 0.017) (Figure 3D). To confirm that the hypothesized timeframes were correct, we also conducted an exploratory investigation of timeframes 30 s, 45 s, 60 s, 180 s, 270 s, and 360 s to cover time regularly up to 6 min (Tables S1, S2, and S3; Figures S1 and S2). It is also important to note that no differences in HR or HRV were observed across any time intervals between the personalized stimulation group and the F3 group (Table S4).


Citation: Brain Medicine 2025; 10.61373/bm025a.0113
Additionally, we performed LMM to explore the within-week effect of iTBS on delta MADRS score, which, in the context of hypothesis-generating, was not controlled for multiple testing. The first set of models with the three-way interaction between the condition, the distance, and the RR slope yielded a significant full-null comparison at 45 s [χ2(6) = 13.09, P = 0.042]. At all other timeframes (30 s, 60 s, 180 s, 270 s, 360 s), the full model was not significantly different from the null model. We consequently calculated the models for 45 s only. The three-way interaction was not significant (P > 0.05). We decided to reduce the model to the two-way interactions between the condition and the distance, the condition and the slope, and the distance and the slope. Again, we did not observe any significant interaction. We further reduced the 45 s model to the main effects and observed a significant effect for the age predictor (β = 0.006, P = 0.012). As the full-null comparisons did not yield any significant differences (P > 0.05), we could not replicate this finding with our secondary outcome measures [Hamilton Depression Rating Scale (HAMD) and Beck Depression Inventory (BDI-II]) as the dependent variables.
The second set of models with the three-way interaction between the condition, the distance, and the RMSSD difference score was significantly different from the null model at all timeframes [χ2(6) = 12.70, P = 0.048 (30 s), χ2(6) = 14.83, P = 0.021 (45 s), χ2(6) = 14.19, P = 0.028 (60 s), χ2(6) = 13.81, P = 0.032 (180 s), χ2(6) = 17.58, P = 0.007 (270 s), and χ2(6) = 24.67, P = 0.0004 (360 s)]. Still, the three-way interaction was not significant (P > 0.05). We reduced the models by removing the insignificant interactions between the condition and the distance, and between the RMSSD change and the distance. In doing so, the two-way interaction was significant in the models for 180 s (β = −0.015, P = 0.011) and for 270 s (β = −0.017, P = 0.008, Table 2). We saw a trend in the model for 60 s (β = −0.01, P = 0.075). These estimated fixed effects were all negative, indicating that a greater iTBS-specific change in RMSSD led to worse treatment outcome in our sample. Regarding our secondary outcomes, none of the full null comparisons were significant in HAMD. The set of models with BDI-II was significantly different from the null model at 180 s [χ2(6) = 22.31, P = 0.001], 270 s [χ2(6) = 19.76, P = 0.003], and 360 s [χ2(6) = 21.19, P = 0.002]. The three-way interactions were all insignificant. However, the two-way interactions between RMSSD change and active condition were significant for 180 s (β = −0.022, P < 0.0001), 270 s (β = −0.020, P = 0.002), and 360 s (β = −0.022, P = 0.001).
Discussion
In this RCT, we investigated whether the clinical benefit of accelerated iTBS for treating depression could be related to personalization of left DLPFC stimulation sites and/or early heart rhythm modulations. As the largest prospective study to date to address these aspects, our findings show that clinical improvement driven by iTBS was associated with modulations of HR and HR variability—but not with personalized stimulation sites based on individual resting-state connectivity. As hypothesized, iTBS-driven modulations of HR were significantly associated with clinical improvement in depression. Specifically, greater HR deceleration (RR slope) within the first 45 s of the first stimulation day, the greater the clinical benefit, reflected in MADRS score changes up to 6 weeks of follow-up. Although iTBS-induced HR variability changes (RMSSD) within the first 270 s did not correlate with MADRS score change at 6 weeks, they did predict clinical benefit at a shorter follow-up (1 week), with patients showing lower RMSSD change experiencing greater MADRS score reductions. However, no clinical benefit was observed from using personalized stimulation sites based on individual functional connectivity (personalized site) compared to fixed stimulation sites (F3 site), nor was a relationship found between the distance from ideal sites on the left DLPFC and clinical improvement.
There is considerable anticipation regarding the identification of methods that maximize the therapeutic effect of rTMS (29, 30). The NCG-TMS approach suggests that when the frontal vagal pathways is subjected to high-frequency stimulation, HR deceleration can be observed, as demonstrated in healthy volunteers (24). In the context of MDD, often characterized by elevated HR (31), the HR deceleration observed early in the stimulation protocol could serve as objective evidence for potential effectiveness of the treatment. If this modulation in certain individuals is reactive, it possibly indicates successful activation of the frontal vagal pathway, thereby providing a means to predict which patients will benefit from the intervention. Our results support observation of increased RR slopes under real versus sham iTBS conditions, along with significant correlations between RR slopes and treatment improvement—findings that were previously only observed as trends (8). In fact, this approach has been regarded as so promising that some researchers have suggested it could replace the use of motor output to determine individual stimulation intensity and/or coil positioning in patients with MDD (32). While the use of motor output to determine individual stimulation intensity is a well-established and valuable method for standardization, a clear link between motor cortex action potential thresholds and the left DLPFC remains uncertain. This raises concerns about the accuracy of transferring motor output intensity information to the level required for therapeutic effects. In other words, a model that considers the individual anatomy and function of the DLPFC would be more accurate, as the intensity determined in the motor cortex is dependent on its spatial characteristics, which may differ significantly from those of therapeutic targets. Therefore, a method grounded in the underlying mechanism by which appropriate stimulation of the left DLPFC probably targets the sgACC and triggers cardiac modulations appears highly promising for future clinical applications.
In addition, a meta-analysis identified lower RMSSD as a strong pathological finding in MDD in comparison to HC (33). Generally, HR variability is considered an indicator of well-being (34) and self-regulatory ability (12, 35), both of which are reduced in individuals with depression (33, 36). In this sense, an increase in RMSSD during iTBS could indicate greater frontal vagal modulations, from the DLPFC, through the sgACC to the brainstem and vagus nerve, reaching the heart (29), and thus serving as a potential predictor of clinical improvement. However, to the best of our knowledge, this mechanism has not been demonstrated in previous studies. While we were able to replicate the increase in RMSSD in active versus sham iTBS (8, 37), our exploratory results revealed an unexpected negative relationship. We found that the smaller the change in RMSSD, the greater the reduction in MADRS score. It should be noted that RMSSD assessed here likely reflects stimulation-induced, transient heart–brain coupling effects (8, 38), rather than resting-state RMSSD as an index of baseline autonomic tone. These findings, derived from a LMM, show a significant two-way interaction between the iTBS condition and RMSSD change in predicting delta MADRS score. To further investigate these early heart rhythm modulations, we conducted additional analyses. A direct comparison of baseline MADRS scores between two subgroups, divided by the median RMSSD, showed no significant differences (P > 0.05). This suggests that our results were not influenced by potential flooring effects (e.g., less depressed individuals with higher RMSSD having less room for improvement compared to more depressed subjects). Based on the negative association, we hypothesized that effective engagement of the frontal vagal network would initially result in less HR variability during stimulation, followed by increased HR variability afterward as a compensatory mechanism, aligning with clinical improvements. We further examined whether participants who improved clinically were also those who exhibited higher RR slopes and lower RMSSD changes at different time points. Of the 10 participants with the most significant changes, only 2 showed the greatest clinical improvement, indicating that more complex mechanisms may underlie these modulations. Overall, our results underscore the complexity of RMSSD as an output (39), suggesting that systematic investigation is needed to better delineate the brain-heart pathways and potentially achieve even higher predictive accuracy (8) than that observed with RR slopes alone.
Equally important, regarding clinical improvement, interindividual variability could potentially be reduced, and the effect of iTBS maximized, by optimizing stimulation sites based on individual functional connectivity data. The sgACC has been proposed as a key marker for treatment response in depression (9, 20), and several studies have suggested that stimulation sites with the strongest negative connectivity to the sgACC could yield superior clinical outcomes (10, 13, 14). This method has been widely replicated and is considered the most established approach in the current literature for identifying patients who may benefit from rTMS. However, in our study, the use of functional connectivity data for personalized site selection did not show a clinical advantage. When comparing personalized stimulation sites on the left DLPFC to the fixed F3 stimulation site, no significant difference in MADRS score changes at 6 weeks of follow-up was observed. Additionally, no association between the distance from the ideal stimulation site and clinical improvement was identified. An inspection of the data revealed no spatial pattern of stimulation in the left DLPFC that suggested a clinical advantage. Importantly, because the point on the left DLPFC with the strongest anticorrelation to each subject's DMN differs from the point with the strongest anticorrelation to a general sgACC seed, we repeated the analysis as described in previous studies (10). Again, no relationship between the distance from the ideal site and clinical improvement at 6 weeks was found (Figure 2) with accelerated iTBS. This finding contradicts the results of prior studies and may suggest that the profile of the MDD sample evaluated plays a critical role. Given that our study included a larger sample than those in previous studies showing positive results (9, 10), a sample size limitation is unlikely to explain the discrepancy. We also explored the changes in stimulation conditions within 1 week of the intervention but again found no evidence of a relationship. It is possible, however, that subgroups, such as biotypes (40, 41), may respond differently to this personalized approach, leading to better outcomes. Nevertheless, as it stands, we could not demonstrate any clinical benefit from the implementation of this personalization method. Our results align with Morriss et al., Our findings align with Morriss et al. (42); while they compared resting state functional MRI (rsfMRI)- and structural MRI-guided protocols without finding differences in clinical benefit, we compared personalized targeting with the F3 position of the international 10–20 EEG system—an approach that is simpler and more feasible in clinical practice. Similarly, Elbau et al. (43) reported only a weak association between sgACC–DLPFC connectivity and treatment response, highlighting the limited utility of such measures in larger, representative samples. Overall, it must be acknowledged that using resting-state functional connectivity data as the basis for this precision method was labor-intensive, expensive, and did not reveal a direct clinical advantage in this RCT. Future novel approaches (44) may provide new insights into this challenge.
Regarding the secondary outcomes (BDI-II and HAMD), a previous study found a negative correlation between baseline RMSSD and baseline HAMD-24 (43), which was not observed in our sample using the HAMD-17. Other findings could not be replicated, except for the two-way interactions between RMSSD change and the active condition in delta BDI-II score LMMs.
The present study has several limitations. The heterogeneity of depression, due to variations in presenting symptoms, is well documented, prompting efforts to identify subtypes and their underlying neural correlates (40, 41, 45, 46). Due to limitations in sample size and insufficient representation of MDD clinical variations, we were unable to explore these important aspects. This should be a focus for future studies with larger and more diverse datasets. Additionally, we found no relationship between RR slopes or RMSSD changes and delta BDI-II or HAMD scores. This may be explained by the factor analytic approach of Uher et al. (47), who showed that the MADRS, BDI-II, and HAMD items load differently on three factors: the MADRS mainly on the observed mood factor, the BDI-II on the cognitive factor, and the HAMD on the neurovegetative factor. Given the dimensional complexity of MDD, it is not surprising that clinical improvement is not captured equally across all depression scales. Personalized connectivity-based targeting is highly dependent on the specific methods of data acquisition, analysis, and implementation; therefore, our results cannot be directly generalized to all targeting protocols. It should also be noted that in the personalized group, a small number of stimulated sites deviated by more than 10 mm from the calculated connectivity-based target. Such deviations are inherent to the practical implementation of neuronavigated targeting, where factors such as coil positioning, individual anatomy, and patient tolerance may limit exact placement. V6 corresponds to approximately 5 weeks after the start of active stimulation in the active-sham group and approximately 3 weeks in the sham-active group, due to the crossover design. This design was chosen as no alternative approach with comparable rigor and power exists to quantify sham effects. Both F3 and personalized groups were balanced across these intervals, and averaging over the full course provides a conservative estimate, avoiding overestimation of intervention effects. Nonetheless, the crossover design complicates the separation of long-term effects between iTBS conditions due to potential carryover (48). While placebo effects were not sustained, they were robust, indicating that the rTMS setup may enhance expectation effects (28, 49, 50), which future studies should carefully control for.
In conclusion, we demonstrated an association between HR deceleration during the first 45 s of stimulation and the degree of reduction in depressive symptoms up to 6 weeks after the aiTBS protocol. However, this relationship was not observed in HR variability modulation at 270 s, which correlated with clinical improvement only within the stimulation week, presenting important predictive limitations. Moreover, clinical improvement was not related to the distance between stimulation and ideal sites, and no benefit was observed between the personalized and F3 groups. In contrast, the use of online ECG emerges as a promising and cost-effective approach that could be further developed for MDD treatment with iTBS. Overall, the promise of precision medicine may be realized through improved targeted stimulation using a combination of neuronavigation and ECG, which could allow for the stratification of patients more likely to benefit from iTBS.
Methods
Participants and study protocol
We selected 125 depressed patients between April 2019 and July 2021, recruited through national advertisements, to participate in an rTMS study. Inclusion criteria were age between 18 and 60 years, diagnosis of MDD confirmed on site by medical assessment using the Structured Clinical Interview for DSM-5 Disorders-Clinician Version (SCID-5-CV), and moderate-to-severe depressive episode, according to MADRS score. The intervention was on an outpatient basis, although both inpatient and outpatient participants were included. Exclusion criteria were epilepsy or other neurological diseases, pregnancy, metallic implants, claustrophobia, as contraindications to MRI or rTMS, or, in case of regular psychopharmacological medication intake, change of type or dosage in the 2 weeks prior to participation or during the course of the study. To account for the psychotropic medication load (51–55), we summed for each individual the estimated intake dose of each substance at each visit, based on the interview or when possible on the circulating blood levels (below = 1, normal = 2, or above = 3 of the expected dose). The study protocol (clinicaltrials.gov/ct2/show/NCT05260086) is in accordance with the Declaration of Helsinki and was approved by the Ethics Committee of the University Medical Center Göttingen (UMG). Prior to enrollment, verbal and written informed consent was obtained from all participants.
Once enrolled, the participant was randomized to participate in a 6-week quadruple-blind (patient, stimulation provider, principle investigator, clinical rater) crossover trial. We used the randperm function in MATLAB (The MathWorks, Inc., Natick, MA, USA) to generate a true random sequence that coded if the participant would receive the stimulation either at the F3 position of the international 10–20 EEG system (F3 group) or at personalized left DLPFC sites (personalized group) (Figure 1A), and either as active-sham or sham-active conditions (Figure 1B). At baseline and at the end of every week, a trained rater administered the MADRS (56) as a primary outcome measure, and additionally the Beck Depression Inventory (BDI-II) (57) and the Hamilton Depression Rating Scale (HAMD) (58) as secondary outcome measures.
Magnetic resonance imaging
We collected structural data with T1-weighted scans (1 × 1 × 1 mm) with a 64-channel head coil in a 3T MR scanner (MAGNETOM Prisma, Siemens Healthcare, Erlangen, Germany). We acquired the rsfMRI data using T2*-weighed gradient-echo echoplanar imaging with the following parameters (59, 60): repetition time of 1.5 s, echo time of 30 ms, flip angle of 70°, 69 axial slices with multiband factor of 3, 2 × 2 × 2 mm, field of view (FOV) of 189 mm, with 10% interval between slices and posterior to anterior phase encoding. The rsfMRI data were acquired with 400 volumes in 10 minutes. The gradient-echo field map was acquired with a repetition time of 704 ms, echo times of 4.92 ms (TE 1) and 7.38 ms (TE 2), flip angle of 60°, 73 slices, FOV of 210 mm, 2 × 2 × 2 mm, with a 10% interval between cuts and phase encoding from anterior to posterior.
Ideal and stimulated cortical sites
In accordance with prior publications (22, 23), we temporally concatenated the data for group independent component analysis (GICA). The GICA was performed with FSL 5.0.7 software (61) separately for the scans of the first, second, third, and fourth week, and separately for the F3 and personalized group. We identified components that best represented the DMN and the network that best covered the left DLPFC, and back reconstructed these into the normalized rsfMRI data of the individual patients. We then identified the left DLFPC region that most strongly anticorrelated to the DMN of the subject (ideal site) and projected the (x, y, z) coordinates from normalized to native space. Noteworthy, our method to determine the strongest left DLPFC anticorrelation to the DMN in each participant differs from a commonly used approach in which the area of strongest anticorrelation is found from seed-based connectivity analyses using a common sgACC seed derived from HCs (10). Therefore, we also extended the analysis to the seed method to explore whether this approach would yield different results.
Our neuronavigation systems (Visor 2 software, ANT Neuro, Enschede, Netherlands) recorded the stimulation coordinates in each session. We calculated the average of the first coordinate of each session in the week (x, y, z) and centered a region of interest (ROI) on it with spheres of 2 mm radius using MarsBar (62). To find the point on the cortex closest to the ROI on the skull, we calculated the Euclidean distance between the segmented cortex of each individual T1-weighted image and the center of the ROI using SPM12 and MATLAB. Finally, we calculated the Euclidean distance between the ideal cortical site and the stimulated cortical site for each observation and used this variable (“distance”) in further analyses.
ECG acquisition and analysis
Participants were sitting quietly in a reclined chair for at least 10 min before recordings started. We acquired ECG data 2 min before (baseline) and throughout iTBS using the three electrodes of the neuroConn NCG-rTMS device (neuroCare, Munich, Germany)—ground electrode placed near the lower end of the sternum, red electrode placed on the first intercostal space on the right side, green electrode placed on the fourth or fifth intercostal space on the left side. The device recorded the ECG at 1000 Hz as well as the iTBS bursts, allowing for exact synchronization of ECG and iTBS information. We analyzed the ECG data using the Biosignal Processing in Python (BioSPPy) toolbox, the heart rate variability (HRV) analysis toolbox, and the SciPy module (63) in Python 3.6. We used SciPy to remove power-line noise from the ECG signal and then used BioSPPy to remove band wander noise in the data and detect R peaks of the QRS complex as the maxima of the curve ± 25 ms. Then, we calculated the RR interval using the timestamps of detected R-peaks. We then preprocessed the RR interval by developing a Python correction of ectopic beats and other types of outliers as detailed in Lipponen and Tarvainen (2019) (64). Next, we selected the timeframe of 45 s from the start of iTBS for the RR interval, as the HR deceleration (represented by a positive slope) can be seen stronger in comparison to the sham condition (8). We also selected the root mean square of successive differences (RMSSD) in the time domain as an index of HRV due to being consistent and presenting the largest difference between MDD and HC among HRV parameters (33). The RMSSD is indicative of the parasympathetic influence of the autonomous nervous system on HR and lower values of RMSSD suggest lower parasympathetic (vagal) activity (65). Moreover, delta RMSSD showed significant decrease during cognitive stress (66). Based on criticism against ultra-short-term HRV and the recommendation of at least 4 min to draw accurate conclusions about HRV (67, 68), we selected the RMSSD timeframe of 270 s after stimulation starts. Importantly, RMSSD during stimulation likely reflects rTMS-induced entrainment of cardiac rhythm, also referred to as Heart–Brain Coupling (8, 38, 69), rather than resting-state HRV.
Accelerated intermittent theta burst and sham stimulations
Stimulation sessions took place during week 2 and week 4. We used a MagVenture X100 with Mag-option and a figure-of-eight MCF-B65 A/P cooled coil. The resting motor threshold was reassessed daily via electromyography in the right first dorsal interosseous muscle with the lowest intensity eliciting 50 mV motor reaction in 5 of 10 attempts, and the stimulation intensity was adjusted to 110%. In the active condition, we applied the stimulation parameters (4): stimulation ON lasted for 2 s, followed by 8 s of stimulation OFF, a volley composed of 10 units of three individual bursts with a frequency of 5 Hz and each burst with three single pulses with 50 Hz, performed 60 times, resulting in 1800 per session (7200 pulses daily and 36,000 pulses in total). In the sham condition, the same MCF-B65 A/P coil was blindly rotated 180 degrees, according to precoded sequences provided by the study principal investigator, who had no contact with the participants. Each daily visit included four sessions (accelerated protocol) of approximately 10 minutes per session, followed by at least 20-min pauses, according to the necessary intersession time described on previous literature (69). As part of the device, transcutaneous electrical nerve stimulation (tENS) electrodes placed in the scalp as close as possible to the coil mimicked tingling skin sensations simultaneously to the sound of pulses, regardless of the stimulation condition. After completion of all stimulation sessions, participants were asked to indicate their expectation about the effectiveness of the stimulation using a visual analog scale (VAS). As there was no significant difference between expectation levels under stimulation conditions [t (70) = 1.47, P = 0.15, see Table 1], we see the blinding procedure as effective.
Statistical analysis
Statistical analyses were conducted in RStudio (38), adopting a significance level of two-sided α = 0.05. Participants were excluded from the analysis, if the baseline MADRS score < 7 or the z-standardized outcome score > ± 2.5 SD. Levene's test was used to check the homogeneity of variance assumption and Mauchly's test was used to check the sphericity of variances. To investigate sample characteristics in the F3 and personalized groups, we used nonparametric Wilcoxon rank-sum (W) and chi-square (χ2) tests. For investigating the MADRS scores and the interaction between groups over time, we conducted repeated mixed measures ANOVA (F) with the time as the within-subject factor and F3 or personalized groups as a between-subjects factor. Due to significant differences between groups in sex distribution, we added sex as an additional between-subjects factor.
We conducted independent analyses of Wilcoxon rank-sum (W) test between the F3 and personalized groups to investigate the Euclidean distance. We further computed Pearson correlation coefficients between the distance and delta MADRS as well as between HR/HRV and delta MADRS. By conducting dependent t test (t) or Wilcoxon rank-sum (W) test, if the assumption of normality in the data is violated, we compared the ECG measures between the active and the sham condition. We calculated Cohen's d and the 95% confidence interval as effect size within and between groups and conditions. An effect size of d > 0.10 indicates a small, an effect size of d > 0.30 a medium, and an effect size of d > 0.50 a large effect (71). Delta MADRS score was computed as 1 – poststimulation score/prestimulation score. The primary outcome was delta MADRS score within 6 weeks, but we also investigated delta MADRS score within each stimulation week as an attempt to unravel possible carryover effects. For the later, we used the R packages lme4 (72) and lmerTest (73) tofit the LMMs using the restricted maximum likelihood method and to extract P values. We performed full-null comparisons using the chi-square (χ2) test of our hypothesized full model and a corresponding null model with a random structure. To minimize the accumulation of α error, we only extracted and interpreted the estimated fixed effects when the full model was significantly different from the null model.
Data availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
Acknowledgments
We thank the study participants and the colleagues from the Laboratory of Systems Neuroscience and Imaging in Psychiatry, the Department of Psychiatry and Psychotherapy, and the Department of Cognitive Neurology of the University Medical Center Göttingen. In particular, we are grateful for the great dedication of Ms. Ilona Pfahlert and Ms. Britta Perl. We thank Dr. Roger Mundry from the Leibniz ScienceCampus Primate Cognition (https://www.primate-cognition.eu/en) for statistical support. We also thank Mr. Klaus Schellhorn, Dr. Farnoosh Safavi, and Mr. Matthias Kienle on behalf of customer service at neuroConn, ANTNeuro, and MagVenture, respectively. It is important to note that customer service had no influence on the work described here.
Author contributions
All listed authors made substantial contributions to various aspects of the work. RGM funded and designed all experiments, oversaw the entire data collection and wrote the manuscript. JW and HMJ analyzed all data, made all figures, and wrote the manuscript. IA, VB, PD, LE, NH, VK, LEM, YM, CR, KS, AS, FW; JW, AW, VZ, and AZ made significant contributions to the acquisition and interpretation of data. MW, CSS, and TEG made substantial contributions to the analysis and interpretation of data and substantively revised the manuscript. TEG also collected data and oversaw the entire data collection. The manuscript has been read and approved by all authors. All authors take full responsibility for all data, figures, and text and approve the content and submission of the study. No related work is under consideration elsewhere.
Funding sources
This work was supported by the German Federal Ministry of Education and Research (Bundesministerium für Bildung und Forschung, BMBF: 01 ZX 1507, ‘‘PreNeSt - e:Med’’). Jonas Wilkening was supported by the Göttingen Promotionskolleg für Medizinstudierende, funded by the Jacob-Henle-Programm/Else-Kröner-Fresenius-Stiftung and Jens Wiltfang is supported by an Ilídio Pinho Professorship, iBiMED (UID/BIM/04501/2013) and FCT project PTDC/DTP-PIC/5587/2014 at the University of Aveiro, Portugal.
Author disclosures
The authors declare no conflicts of interest. Two NCG-ENGAGE HR boxes were provided on loan by neuroCare for use in this study; however, they had no knowledge of or influence on the results obtained. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication. The manuscript has been read and approved by all authors.
Supporting Online Material

Participants and study protocol. (A) Study flow diagram and (B) study design.

MADRS scores, stimulation site distance, and spatial distribution in F3 and personalized groups. Panel 1: Montgomery–Åsberg Depression Scale (MADRS) score from weekly visits V1 to V6 with active or sham stimulation between V1-V2 and V3-V4 visits (A) in F3 and personalized groups and (B) in F3 and personalized groups with sham-active and active-sham subgroups. Error bars represent mean ± SE. Panel 2: Relationship between clinical improvement (delta MADRS) and distance to ideal stimulation site calculated by (A,C) the default mode network (22, 23) and (B,D) the subgenual anterior cingulate cortex (9–11) methods. F3 stimulation in red (dark red: active, light red: sham), personalized stimulation in green (dark green: active, light green: sham); r indicates Pearson's correlation coefficient. Panel 3: Individual stimulation sites in native space projected to the MNI space in (A) whole sample, (B) personalized, and (C) F3 groups. Points are color-coded according to MADRS score 1 – (post/pre) within 6 weeks. Note the absence of a clear spatial pattern of color distribution. Also of note, projection to MNI space involves warping, which may limit the precision of individual native-space reconstructions.

Stimulation-induced the RR interval (first 45 s) and RMSSD (first 270 s) modulations along with the corresponding Montgomery–Åsberg Depression Scale (MADRS) score changes. (A) The association between the RR slope and delta MADRS score within 6 weeks was seen in the active condition (red) only, and (B) the RR slope was stronger in active (red) versus sham (gray) conditions. (C) The association between the RMMSD change and delta MADRS score within 1 week was seen in the active condition (red) only, and (D) the RMMSD change was stronger in active (red) versus sham (gray) within 1 week. Error bars represent mean ± SE. *P < 0.05, **P < 0.01.
Contributor Notes
Publisher's note: Genomic Press maintains a position of impartiality and neutrality regarding territorial assertions represented in published materials and affiliations of institutional nature. As such, we will use the affiliations provided by the authors, without editing them. Such use simply reflects what the authors submitted to us and it does not indicate that Genomic Press supports any type of territorial assertions.