Describing Complex Charts in Natural Language: A Caption Generation System - Robotics Institute Carnegie Mellon University

Describing Complex Charts in Natural Language: A Caption Generation System

Vibbhu Mittal, Johanna Moore, Giuseppe Carenini, and Steven F. Roth
Journal Article, Computational Linguistics: Special Issue on Natural Language Generation, Vol. 24, No. 3, pp. 431 - 467, September, 1998

Abstract

Graphical presentations can be used to communicate information in relational data sets succinctly and effectively. However, novel graphical presentations that represent many attributes and relationships are often difficult to understand completely until explained. Automatically generated graphical presentations must therefore either be limited to generating simple, conventionalized graphical presentations, or risk incomprehensibility. A possible solution to this problem would be to extend automatic graphical presentation systems to generate explanatory captions in natural language, to enable users to understand the information expressed in the graphic. This paper presents a system to do so. It uses a text planner to determine the content and structure of the captions based on: (1) a representation of the structure of the graphical presentation and its mapping to the data it depicts, (2) a framework for identifying the perceptual complexity of graphical elements, and (3) the structure of the data expressed in the graphic. The output of the planner is further processed regarding issues such as ordering, aggregation, centering, generating referring expressions and lexical choice. We discuss the architecture of our system and its strengths and limitations. Our implementation is currently limited to 2-D charts and maps, but, except for lexical information, it is completely domain independent. We illustrate our discussion with figures and generated captions about housing sales in Pittsburgh.

BibTeX

@article{Mittal-1998-16588,
author = {Vibbhu Mittal and Johanna Moore and Giuseppe Carenini and Steven F. Roth},
title = {Describing Complex Charts in Natural Language: A Caption Generation System},
journal = {Computational Linguistics: Special Issue on Natural Language Generation},
year = {1998},
month = {September},
volume = {24},
number = {3},
pages = {431 - 467},
}