Exploratory Analysis of Spatial Data and Decision Making
Using Interactive Maps and Linked Dynamic Displays
Tutorial
Natalia Andrienko and Gennady Andrienko
 |
Fraunhofer AIS – Institute for Autonomous Intelligent Systems
Schloss Birlinghoven, Sankt-Augustin, D-53754 Germany
http://www.ais.fraunhofer.de/and
e-mail: {Natalia.Andrienko, Gennady.Andrienko}@ais.fraunhofer.de
|
Abstract
The importance of exploratory data analysis (EDA) as a prerequisite to
application of computational methods, such as traditional statistical analysis,
is currently widely recognised. The goal of EDA is to gain understanding
of data, i.e. to penetrate into relationships, patterns, and trends hidden
inside data and to formulate hypotheses that can later be checked using
statistical methods. Preliminary investigation of data must also precede
their preparation to processing by various computation-based analysis tools,
such as data mining.
Techniques of EDA are mostly based on data visualisation, i.e.
the graphical presentation of data in ways that prompt the discovery of
important traits and relationships. Computers enabled features of graphical
presentations that are now considered indispensable for EDA: high user
interactivity, allowance for various transformations, and multiple dynamically
linked views such that changes in one display are immediately propagated
to all others. An important category of data dealt with in statistics is
spatially
referenced data. For visualization of such data, maps are traditionally
used, since they are isomorphic to space and thus capable of representing
and conveying to human’s eye significant spatial relationships. High degree
of user interactivity is a general requirement to map displays intended
to support “spatial thinking”, i.e. hypothesis generation, data analysis,
and decision making. Examples of possible user interactions include:
-
Dynamic linking of maps with other types of graphical displays by so-called
“brushing”: objects selected in one of the displays are simultaneously
highlighted in all of them;
-
tools for changing in real time parameters of conventional cartographic
methods such as the classes in a choropleth map;
-
various interactive devices for controlling map animation
Still, interactive techniques and tools can support information
exploration and knowledge construction only when users are able to properly
utilise these instruments. User studies demonstrate that effective use
of the novel techniques requires learning of the new concepts and ideas.
Users are able to understand and adopt the new ideas concerning map interactivity
and manipulability. However, these ideas needed to be appropriately introduced;
people could not grasp them just from the appearance of the maps and controls.
Our Spatial Decision Support Team (SPADE) at the Fraunhofer Institute for
Autonomous Intelligent Systems designs and implements novel visualisation
techniques to support exploratory data analysis and decision making in
a spatial context. In particular, we have analysed traditional methods
of graphical and cartographical data representation, revealed their strong
and weak sides, and found ways to enhance their strengths and compensate
for weaknesses by adding interactivity and dynamics. We combine cartographic
visualisation with other methods of graphical data representation and data
analysis methods from other disciplines, such as statistics and data mining.
To support multi-criteria decision making, we combine established techniques
for multi-criteria decision support with interactive maps and graphs and
invent our own methods, highly interactive and visual. We suggest a range
of techniques for decision support accommodating various styles of decision-making.
The general topic of the proposed tutorial is visualisation of spatial
data as a tool for exploratory data analysis, problem solving, and decision-making.
The tutorial will be based on the CommonGIS system developed by the SPADE
team. The system is available for free use for research and educational
purposes. The participants will receive CD-ROMs containing fully functional
CommonGIS system and a variety of demonstration projects.
Most of the tools considered in the tutorial are unique for CommonGIS
since this system have been specially designed for the most efficient support
of EDA and decision-making. However, the experience gained in the tutorial
can be also utilised in work with commercially available software. In many
cases, operations that are effectively “packed” in CommonGIS into a single
tool may be done in other packages through sequences of data transformation,
calculation, and visualization operations. Some recommendations concerning
data exploration and decision making with the use of available commercial
software will be given.
The tutorial will be held in 1,5 hours and consists of a lecture combined
with demonstration. Depending on the availability of equipment, the participants
can be given an opportunity for the individual work with the demonstrated
tools. For better organisation of the tutorial, it is desirable to supply
participants with computers, at least 1 PC per 2 participants. The use
of private laptops is also possible.
Expected audience
The tutorial is intended for the following audience:
-
People who are performing analysis of spatially- and temporally-related
data as a part of their professional activities;
-
Decision makers who are interested in making informed decisions;
-
Researchers interested in novel techniques of interactive graphical data
analysis.
Only basic knowledge in using computers and data processing software is
expected. No background in GIS, spatial data processing, and statistics
is required.
Tutorial structure
-
Introduction to EDA. Cartographical visualisation of spatially referenced
data. Major techniques for thematic mapping. Interactive vs. traditional
(static) maps.
-
Interactive map manipulation techniques for visual data analysis:
-
choropleth maps with tools for focusing, visual comparison, and pattern
investigation;
-
dynamic classification and cross-classification maps;
-
bar charts and pie charts with tools for visual comparison and focusing.
-
Dynamic linking of maps and other graphical displays
-
simultaneous highlighting of corresponding objects;
-
dynamic query;
-
propagation of classes.
-
Data analysis in space and time
-
map animation; analytical interactive tools for animated maps;
-
time plots and their transformations;
-
space-time cube.
-
Decision making
-
different styles of multiple criteria decision making;
-
ideal point method;
-
dynamic utility symbols;
-
generalised Pareto set, combination with dynamic query
Illustrations: some interactive maps and plots
Figure 1. Screenshot of usage of dynamically linked displays
in a problem-solving session
Figure 2. Effect of outlier removal (focusing). Due to a very
high value of the birth rate in Albania all other countries in the map
on the left seem to be coloured almost uniformly. When Albania is taken
out of consideration (“removed”), differences between the other countries
can be represented more distinctly. The operation is performed using interactive
focussing control; see its appearance left and right to the map for initial
and final states.
Figure 3. Visual comparison technique. On the left, the birth
rate in Germany is chosen as the reference value. Countries with higher
values are painted in shades of brown, and countries with lower values
in shades of blue. On the right, the reference value has been changed to
equal the birth rate in France. In CommonGIS this is actually achieved
just by a mouse click on France. Alternatively, user can click on a colour
band (shown left and right to the map) to set the reference value.
Instructors:
Dr. Natalia Andrienko and Dr. Gennady Andrienko are managers of the SPADE
team on research and technology, respectively. They received their Master
degrees in Computer Science from Kiev State University in 1985 and 1986,
and a Ph.D. equivalent in Computer Science from Moscow State University
in 1992 and 1993, respectively. They worked on knowledge-based systems
at the Mathematics Institute of Moldavian Academy of Sciences (Kishinev,
Moldova), then at the Institute on Mathematical Problems of Biology of
Russian Academy of Science (Pushchino Research Center, Russia). Since 1997
Drs. Andrienko have research positions at GMD (now Fraunhofer Institute
AiS). They play key roles in numerous international R&D projects, including
EU-funded projects CommonGIS, SPIN!, and GIMMI. They are authors of more
than 100 refereed papers that have been published in international journals,
books, and conference proceedings. They are active members of several commissions
of the International Cartographic Association (Commission on Visualization
and Virtual Reality, Commission on Maps and the Internet, and Commission
on Theoretical Cartography). Their research interests and experiences are
interactive computer graphics, automated knowledge-based cartographic visualisation,
information visualisation, spatial data mining, and visual geo-data exploration.
For a list of publications, see http://www.ais.fraunhofer.de/and/publications.html
Appendix I
Survey of CommonGIS history and functionality
Apendix II
Selected publications