TY - JOUR
T1 - Sentiment-devoid lexicons
T2 - A novel method for domain-specific textual analysis in business and governance documents
AU - Ma, Wentao
AU - Ho, Shuk Ying
N1 - Publisher Copyright:
© 2024
PY - 2025/1
Y1 - 2025/1
N2 - Our study proposes and tests a method for developing domain-specific dictionaries tailored for textual analysis in information systems research. Traditionally, dictionaries have been widely used for content classification according to sentiment; however, we introduce an alternative approach focused on creating dictionaries from sentiment-devoid documents. We demonstrate this method by developing a dictionary specific to Securities and Exchange Commission (SEC) investigations. Analyzing 150,432 publicly available SEC documents, we gained insights into the semantics of communications between the SEC and firms. To evaluate the dictionary, we analyzed SEC comment letters to predict the likelihood of firms reporting information technology control weaknesses (ITCWs), information technology audit fees, and cyber risks. Our dictionary outperformed five benchmarking dictionaries, explaining a higher proportion of variance in ITCW likelihood, information technology audit fees, and cyber risks. This study enhances the effectiveness of dictionaries in analyzing sentiment-devoid business and governance documents and results in a specialized dictionary for SEC communications.
AB - Our study proposes and tests a method for developing domain-specific dictionaries tailored for textual analysis in information systems research. Traditionally, dictionaries have been widely used for content classification according to sentiment; however, we introduce an alternative approach focused on creating dictionaries from sentiment-devoid documents. We demonstrate this method by developing a dictionary specific to Securities and Exchange Commission (SEC) investigations. Analyzing 150,432 publicly available SEC documents, we gained insights into the semantics of communications between the SEC and firms. To evaluate the dictionary, we analyzed SEC comment letters to predict the likelihood of firms reporting information technology control weaknesses (ITCWs), information technology audit fees, and cyber risks. Our dictionary outperformed five benchmarking dictionaries, explaining a higher proportion of variance in ITCW likelihood, information technology audit fees, and cyber risks. This study enhances the effectiveness of dictionaries in analyzing sentiment-devoid business and governance documents and results in a specialized dictionary for SEC communications.
KW - Comment letters
KW - Dictionary
KW - Information technology control weaknesses
KW - SEC reviews
KW - Sentiment-devoid
KW - Textual analysis
UR - http://www.scopus.com/inward/record.url?scp=85209949312&partnerID=8YFLogxK
U2 - 10.1016/j.im.2024.104055
DO - 10.1016/j.im.2024.104055
M3 - Article
AN - SCOPUS:85209949312
SN - 0378-7206
VL - 62
JO - Information and Management
JF - Information and Management
IS - 1
M1 - 104055
ER -