Quantitative approaches to content analysis: Identifying conceptual drift across publication outlets

Marta Indulska, Dirk S. Hovorka, Jan Recker

Research output: Contribution to journalArticleResearchpeer-review

48 Citations (Scopus)
13 Downloads (Pure)

Abstract

Unstructured text data, such as emails, blogs, contracts, academic publications, organizational documents, transcribed interviews, and even tweets, are important sources of data in Information Systems research. Various forms of qualitative analysis of the content of these data exist and have revealed important insights. Yet, to date, these analyses have been hampered by limitations of human coding of large data sets, and by bias due to human interpretation. In this paper, we compare and combine two quantitative analysis techniques to demonstrate the capabilities of computational analysis for content analysis of unstructured text. Specifically, we seek to demonstrate how two quantitative analytic methods, viz., Latent Semantic Analysis and data mining, can aid researchers in revealing core content topic areas in large (or small) data sets, and in visualizing how these concepts evolve, migrate, converge or diverge over time. We exemplify the complementary application of these techniques through an examination of a 25-year sample of abstracts from selected journals in Information Systems, Management, and Accounting disciplines. Through this work, we explore the capabilities of two computational techniques, and show how these techniques can be used to gather insights from a large corpus of unstructured text.

Original languageEnglish
Pages (from-to)49-69
Number of pages21
JournalEuropean Journal of Information Systems
Volume21
Issue number1
DOIs
Publication statusPublished - Jan 2012

Fingerprint

content analysis
Information systems
Blogs
Electronic mail
Data mining
Semantics
information system
Chemical analysis
systems research
weblog
coding
semantics
examination
interpretation
trend
interview
management

Cite this

@article{b3fb7fa580b24dd1815d1b400c3fa204,
title = "Quantitative approaches to content analysis: Identifying conceptual drift across publication outlets",
abstract = "Unstructured text data, such as emails, blogs, contracts, academic publications, organizational documents, transcribed interviews, and even tweets, are important sources of data in Information Systems research. Various forms of qualitative analysis of the content of these data exist and have revealed important insights. Yet, to date, these analyses have been hampered by limitations of human coding of large data sets, and by bias due to human interpretation. In this paper, we compare and combine two quantitative analysis techniques to demonstrate the capabilities of computational analysis for content analysis of unstructured text. Specifically, we seek to demonstrate how two quantitative analytic methods, viz., Latent Semantic Analysis and data mining, can aid researchers in revealing core content topic areas in large (or small) data sets, and in visualizing how these concepts evolve, migrate, converge or diverge over time. We exemplify the complementary application of these techniques through an examination of a 25-year sample of abstracts from selected journals in Information Systems, Management, and Accounting disciplines. Through this work, we explore the capabilities of two computational techniques, and show how these techniques can be used to gather insights from a large corpus of unstructured text.",
author = "Marta Indulska and Hovorka, {Dirk S.} and Jan Recker",
year = "2012",
month = "1",
doi = "10.1057/ejis.2011.37",
language = "English",
volume = "21",
pages = "49--69",
journal = "Journal of applied systems analysis",
issn = "0960-085X",
publisher = "Palgrave Macmillan",
number = "1",

}

Quantitative approaches to content analysis : Identifying conceptual drift across publication outlets. / Indulska, Marta; Hovorka, Dirk S.; Recker, Jan.

In: European Journal of Information Systems, Vol. 21, No. 1, 01.2012, p. 49-69.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - Quantitative approaches to content analysis

T2 - Identifying conceptual drift across publication outlets

AU - Indulska, Marta

AU - Hovorka, Dirk S.

AU - Recker, Jan

PY - 2012/1

Y1 - 2012/1

N2 - Unstructured text data, such as emails, blogs, contracts, academic publications, organizational documents, transcribed interviews, and even tweets, are important sources of data in Information Systems research. Various forms of qualitative analysis of the content of these data exist and have revealed important insights. Yet, to date, these analyses have been hampered by limitations of human coding of large data sets, and by bias due to human interpretation. In this paper, we compare and combine two quantitative analysis techniques to demonstrate the capabilities of computational analysis for content analysis of unstructured text. Specifically, we seek to demonstrate how two quantitative analytic methods, viz., Latent Semantic Analysis and data mining, can aid researchers in revealing core content topic areas in large (or small) data sets, and in visualizing how these concepts evolve, migrate, converge or diverge over time. We exemplify the complementary application of these techniques through an examination of a 25-year sample of abstracts from selected journals in Information Systems, Management, and Accounting disciplines. Through this work, we explore the capabilities of two computational techniques, and show how these techniques can be used to gather insights from a large corpus of unstructured text.

AB - Unstructured text data, such as emails, blogs, contracts, academic publications, organizational documents, transcribed interviews, and even tweets, are important sources of data in Information Systems research. Various forms of qualitative analysis of the content of these data exist and have revealed important insights. Yet, to date, these analyses have been hampered by limitations of human coding of large data sets, and by bias due to human interpretation. In this paper, we compare and combine two quantitative analysis techniques to demonstrate the capabilities of computational analysis for content analysis of unstructured text. Specifically, we seek to demonstrate how two quantitative analytic methods, viz., Latent Semantic Analysis and data mining, can aid researchers in revealing core content topic areas in large (or small) data sets, and in visualizing how these concepts evolve, migrate, converge or diverge over time. We exemplify the complementary application of these techniques through an examination of a 25-year sample of abstracts from selected journals in Information Systems, Management, and Accounting disciplines. Through this work, we explore the capabilities of two computational techniques, and show how these techniques can be used to gather insights from a large corpus of unstructured text.

UR - http://www.scopus.com/inward/record.url?scp=84856994229&partnerID=8YFLogxK

U2 - 10.1057/ejis.2011.37

DO - 10.1057/ejis.2011.37

M3 - Article

VL - 21

SP - 49

EP - 69

JO - Journal of applied systems analysis

JF - Journal of applied systems analysis

SN - 0960-085X

IS - 1

ER -