Scene Synthesis with Automated Generation of Textual Descriptions

Loading...
Thumbnail Image
Date
2022
Journal Title
Journal ISSN
Volume Title
Publisher
The Eurographics Association
Abstract
Most current research on automatically captioning and describing scenes with spatial content focuses on images. We outline that generating descriptive text for a synthesized 3D scene can be achieved via a suitable intermediate representation employed in the synthesis algorithm. As an example, we synthesize scenes of medieval village settings, and generate their descriptions. Our system employs graph grammars, Markov Chain Monte Carlo optimization, and a natural language generation pipeline. Randomly placed objects are evaluated and optimized by a cost function capturing neighborhood relations, path layouts, and collisions. Further, in a pilot study we assess the performance of our framework by comparing the generated descriptions to others provided by human subjects. While the latter were often short and low-effort, the highest-rated ones clearly outperform our generated ones. Nevertheless, the average of all collected human descriptions was indeed rated by the study participants as being less accurate than the automated ones.
Description

CCS Concepts: Computing methodologies --> Computer graphics; Natural language generation

        
@inproceedings{
10.2312:egs.20221026
, booktitle = {
Eurographics 2022 - Short Papers
}, editor = {
Pelechano, Nuria
 and
Vanderhaeghe, David
}, title = {{
Scene Synthesis with Automated Generation of Textual Descriptions
}}, author = {
Müller-Huschke, Julian
 and
Ritter, Marcel
 and
Harders, Matthias
}, year = {
2022
}, publisher = {
The Eurographics Association
}, ISSN = {
1017-4656
}, ISBN = {
978-3-03868-169-4
}, DOI = {
10.2312/egs.20221026
} }
Citation