Program

Image courtesy of Kirk Goldsberry

VDS @IEEE VIS 2024

Registration at https://ieeevis.org/year/2024/info/registration/conference-registration .

Sun. Oct 13, 2024, 12:00 PM - 3:00 PM EDT (UTC-4)

12:00pm - 12:05pm

Opening

12:05pm - 12:45pm

Keynote: Marcus Beck

Marcus Beck

12:05pm~12:45pm EDT (UTC-4)

Keynote: Bringing data visualization to Tampa Bay - Modernizing the past and supporting the future

Dr. Marcus W Beck, Tampa Bay Estuary Program

Abstract: The Tampa Bay Estuary Program (TBEP) has provided over 30 years of support to manage one of Florida’s most precious environmental resources. As one of 28 similar programs in the United States, the TBEP works to achieve consensus on the most scientifically-robust approaches to inform environmental management for Tampa Bay. The TBEP has embraced open science visualization practices as one of its core strategic priorities to improve how research applications bridge the gap between the scientific community and decision makers. These efforts have included modernizing historical reporting workflows and developing novel bay health indicators with robust visualization techniques in a cloud-based environment. In addition to supporting a healthy Tampa Bay, these visualization workflows are documented in a reproducible framework to support future management efforts and to avoid past mistakes of information loss as the program evolves. Much is still needed from the data visualization community to understand how to leverage cutting-edge techniques for the benefit of Tampa Bay and beyond.

Bio: Marcus Beck is the Senior Scientist for the Tampa Bay Estuary Program in St. Petersburg, Florida and is developing data analysis and visualization methods for Bay health indicators. Marcus has experience researching environmental indicators and developing open science products to support decision-making in aquatic environments around the country, including Minnesota lakes, Florida estuaries, and California streams. He has been using the R statistical programming language for over 15 years and has taught several workshops on its application to environmental sciences. Marcus has also developed several R packages and currently maintains 9 on CRAN. He received a PhD in Conservation Biology with a minor in Statistics from the University of Minnesota in 2013, his Masters in Conservation Biology from the University of Minnesota in 2009, and his Bachelors in Zoology from the University of Florida in 2007.

12:45pm - 1:15pm

Paper Session 1

12:45pm - 12:55pm

[Best Paper] Visualization and Automation in Data Science: Exploring the Paradox of Humans-in-the-Loop

Jen Rogers, Emily Wall, Mehdi Chakhchoukh, Marie Anastacio, Rebecca Faust, Cagatay Turkay, Lars Kotthoff, Steffen Koch, Andreas Kerren, Jürgen Bernard

Abstract: This position paper explores the interplay between automation and human involvement in data science. It synthesizes perspectives from Automated Data Science (AutoDS) and Interactive Data Visualization (VIS), which traditionally represent opposing ends of the human-machine spectrum. While AutoDS aims to enhance efficiency by reducing human tasks, VIS emphasizes the importance of nuanced understanding, innovation, and context provided by human involvement. This paper examines these dichotomies through an online survey and advocates for a balanced approach that harmonizes the efficiency of automation with the irreplaceable insights of human expertise. Ultimately, we address the essential question of not just what we can automate, but what we should automate, seeking strategies that prioritize technological advancement alongside the fundamental need for human oversight.

12:55pm - 1:05pm

Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent

Yannick Metz, Dennis Ackermann, Daniel Keim, Maximilian T. Fischer

Abstract: Efficient public transport systems are crucial for sustainable urban development as cities face increasing mobility demands. Yet, many public transport networks struggle to meet diverse user needs due to historical development, urban constraints, and financial limitations. Traditionally, planning of transport network structure is often based on limited surveys, expert opinions, or partial usage statistics. This provides an incomplete basis for decision-making. We introduce an data-driven approach to public transport planning and optimization, calculating detailed accessibility measures at the individual housing level. Our visual analytics workflow combines population-group-based simulations with dynamic infrastructure analysis, utilizing a scenario-based model to simulate daily travel patterns of varied demographic groups, including schoolchildren, students, workers, and pensioners. These population groups, each with unique mobility requirements and routines, interact with the transport system under different scenarios traveling to and from Points of Interest (POI), assessed through travel time calculations. Results are visualized through heatmaps, density maps, and network overlays, as well as detailed statistics. Our system allows us to analyze both the underlying data and simulation results on multiple levels of granularity, delivering both broad insights and granular details. Case studies with the city of Konstanz, Germany reveal key areas where public transport does not meet specific needs, confirmed through a formative user study. Due to the high cost of changing legacy networks, our analysis facilitates the identification of strategic enhancements, such as optimized schedules or rerouting, and few targeted stop relocations, highlighting consequential variations in accessibility to pinpointing critical service gaps. Our research advances urban transport analytics by providing policymakers and citizens with a system that delivers both broad insights with granular detail into public transport services for a data-driven quality assessment at housing-level detail.

1:05pm - 1:15pm

Towards a Visual Perception-Based Analysis of Clustering Quality Metrics

Graziano Blasilli, Daniel Kerrigan, Enrico Bertini, Giuseppe Santucci

Abstract: Clustering is an essential technique across various domains, such as data science, machine learning, and eXplainable Artificial Intelligence. Information visualization and visual analytics techniques have been proven to effectively support human involvement in the visual exploration of clustered data to enhance the understanding and refinement of cluster assignments. This paper presents an attempt of a deep and exhaustive evaluation of the perceptive aspects of clustering quality metrics, focusing on the Davies-Bouldin Index, Dunn Index, Calinski-Harabasz Index, and Silhouette Score. Our research is centered around two main objectives: a) assessing the human perception of common CVIs in 2D scatterplots and b) exploring the potential of Large Language Models (LLMs), in particular GPT-4o, to emulate the assessed human perception. By discussing the obtained results, highlighting limitations, and areas for further exploration, this paper aims to propose a foundation for future research activities.

1:15pm - 1:45pm

Break

1:45pm - 2:15pm

Paper Session 2

1:45pm - 1:55pm

The Categorical Data Map: A Multidimensional Scaling-Based Approach

Frederik L. Dennig, Lucas Joos, Patrick Paetzold, Daniela Blumberg, Oliver Deussen, Daniel Keim, Maximilian T. Fischer

Abstract: Categorical data does not have an intrinsic definition of distance or order, and therefore, established visualization techniques for categorical data only allow for a set-based or frequency-based analysis, e.g., through Euler diagrams or Parallel Sets, and do not support a similarity-based analysis. We present a novel dimensionality reduction-based visualization for categorical data, which is based on defining the distance of two data items as the number of varying attributes. Our technique enables users to pre-attentively detect groups of similar data items and observe the properties of the projection, such as attributes strongly influencing the embedding. Our prototype visually encodes data properties in an enhanced scatterplot-like visualization, visualizing attributes in the background to show the distribution of categories. In addition, we propose two graph-based measures to quantify the plot's visual quality, which rank attributes according to their contribution to cluster cohesion. To demonstrate the capabilities of our similarity-based projection method, we compare it to Euler diagrams and Parallel Sets regarding visual scalability and evaluate it quantitatively on seven real-world datasets using a range of common quality measures. Further, we validate the benefits of our approach through an expert study with five data scientists analyzing the Titanic and Mushroom dataset with up to 23 attributes and 8124 category combinations. Our results indicate that our Categorical Data Map offers an effective analysis method for large datasets with a high number of category combinations.

1:55pm - 2:05pm

Interactive Counterfactual Exploration of Algorithmic Harms in Recommender Systems

Yongsu Ahn, Quinn K Wolter, Jonilyn Dick, Janet Dick, Yu-Ru Lin

Abstract: Recommender systems have become integral to digital experiences, shaping user interactions and preferences across various platforms. Despite their widespread use, these systems often suffer from algorithmic biases that can lead to unfair and unsatisfactory user experiences. This study introduces an interactive tool designed to help users comprehend and explore the impacts of algorithmic harms in recommender systems. By leveraging visualizations, counterfactual explanations, and interactive modules, the tool allows users to investigate how biases such as miscalibration, stereotypes, and filter bubbles affect their recommendations. Informed by in-depth user interviews, both general users and researchers can benefit from increased transparency and personalized impact assessments, ultimately fostering a better understanding of algorithmic biases and contributing to more equitable recommendation outcomes. This work provides valuable insights for future research and practical applications in mitigating bias and enhancing fairness in machine learning algorithms.

2:05pm - 2:15pm

Seeing the Shift: Keep an Eye on Semantic Changes in Times of LLMs

Raphael Buchmüller, Friederike Körte, Daniel Keim

Abstract: This position paper discusses the profound impact of Large Language Models (LLMs) on semantic change, emphasizing the need for comprehensive monitoring and visualization techniques. Building on established concepts from linguistics, we examine the interdependency between mental and language models, discussing how LLMs influence and are influenced by human cognition and societal context. We introduce three primary theories to conceptualize such influences: Recontextualization, Standardization, and Semantic Dementia, illustrating how LLMs drive, standardize, and potentially degrade language semantics. Our subsequent review categorizes methods for visualizing semantic change into frequency-based, embedding-based, and context-based techniques, being first in assessing their effectiveness in capturing linguistic evolution: Embedding-based methods are highlighted as crucial for a detailed semantic analysis, reflecting both broad trends and specific linguistic changes. We underscore the need for novel visual, interactive tools to monitor and explain semantic changes induced by LLMs, ensuring the preservation of linguistic diversity and mitigating linguistic biases. This work provides essential insights for future research on semantic change visualization and the dynamic nature of language evolution in the times of LLMs.

2:15pm - 2:57pm

Reflections, Challenges, and Innovations in Visualization for Data Science: A Conversation

Emcee: Anamaria Crisan

VDS at IEEE VIS 2024 will conclude with a panel reflecting on ten years of symposia at the Vis conference. Paper Chair Ana Crisan will discuss the past, present, and future of visualization for data science with former chairs Hendrik Strobelt (VDS 2020), Mark Streit (VDS 2016), and Claudio Silva (VDS 2015).

2:57pm - 3:00pm

Closing

Menu

Important Dates

Supporters

VDS Archive

Program