An analysis of bar chart usage in corpus data visualization
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
A recent survey of graph usage in corpus-based research has shown that the bar chart is the most widely used graph type for corpus data presentation. Motivated by this finding, the present paper offers a systematic review of bar chart usage in corpus-based research articles. It covers all papers (n = 1,183) published in five corpus-linguistic journals up to and including the year 2024 (International Journal of Corpus Linguistics, Corpus Linguistics and Linguistic Theory, Corpora, Research in Corpus Linguistics, and the International Journal of Learner Corpus Research). The aim of this survey is to arrive at a better understanding of the kinds of visualization tasks imposed on bar charts. We observe that they most commonly show percentages or absolute/normalized frequencies, and that they are often used for relatively complex visualization tasks involving multiple variables and subgroups. The survey is carried out against the backdrop of known limitations of this graph type and design recommendations found in data visualization guidebooks. Our critical examination of diagrams pays attention to issues that compromise the ability of the viewer to accurately perceive patterns in the data, and minor issues that affect the efficiency of a display. These observations are then distilled into a set of concrete recommendations, which are grounded in current usage and the advice given in the data visualization literature.