TY - JOUR
T1 - A two-stage disentangled and balanced representation learning method for counterfactual regression
AU - Wang, Siyi
AU - Huang, Yiyan
AU - Leung, Cheuk Hang
AU - Wang, Chaoqun
AU - Wu, Qi
N1 - Publisher Copyright:
© 2025 Elsevier Inc.
PY - 2026/3/25
Y1 - 2026/3/25
N2 - Estimating ITEs is crucial for personalized decision-making. Prior research has sought to estimate ITEs using the causal representation balancing method, where all features are mapped to an embedding space for the tasks of outcome modeling and representation balancing. Nevertheless, two challenges remain: (i) the potential adverse interactions between the two tasks due to their sharing of the same representation of confounders and (ii) the negative impact of instrumental variables on outcome predictions. These observations motivate us to disentangle instrumental, confounding, and outcome-specific factors according to their causal relationships with the treatment and outcome within the representation balancing framework. We propose an ITE error bound using confounding and outcome-specific factors, showing that representation balancing through confounding factors alone is sufficient. Building upon this ITE error bound, we introduce a two-stage method, Disentangled and Balanced Representation Learning for Counterfactual Regression (DBR-CFR), to estimate ITEs. In the first stage, DBR-CFR effectively disentangles observed pre-treatment variables into three desired factors. In the second stage, DBR-CFR minimizes the objective function based on the empirical ITE error bound using disentangled factors. Extensive experimental results on synthetic and benchmark datasets verify the effectiveness and superiority of our proposed method. The source code is available at https://github.com/sssywang/DBR-CFR.
AB - Estimating ITEs is crucial for personalized decision-making. Prior research has sought to estimate ITEs using the causal representation balancing method, where all features are mapped to an embedding space for the tasks of outcome modeling and representation balancing. Nevertheless, two challenges remain: (i) the potential adverse interactions between the two tasks due to their sharing of the same representation of confounders and (ii) the negative impact of instrumental variables on outcome predictions. These observations motivate us to disentangle instrumental, confounding, and outcome-specific factors according to their causal relationships with the treatment and outcome within the representation balancing framework. We propose an ITE error bound using confounding and outcome-specific factors, showing that representation balancing through confounding factors alone is sufficient. Building upon this ITE error bound, we introduce a two-stage method, Disentangled and Balanced Representation Learning for Counterfactual Regression (DBR-CFR), to estimate ITEs. In the first stage, DBR-CFR effectively disentangles observed pre-treatment variables into three desired factors. In the second stage, DBR-CFR minimizes the objective function based on the empirical ITE error bound using disentangled factors. Extensive experimental results on synthetic and benchmark datasets verify the effectiveness and superiority of our proposed method. The source code is available at https://github.com/sssywang/DBR-CFR.
KW - Counterfactual regression
KW - Disentangled representations
KW - Individual treatment effects (ITEs)
KW - Representation learning
UR - https://www.scopus.com/pages/publications/105022905223
U2 - 10.1016/j.ins.2025.122886
DO - 10.1016/j.ins.2025.122886
M3 - Article
AN - SCOPUS:105022905223
SN - 0020-0255
VL - 730
JO - Information Sciences
JF - Information Sciences
M1 - 122886
ER -