Kitware at WACV 2023

Leaders in Artificial Intelligence, Machine Learning, and Computer Vision

WACV is a premier international computer vision conference that attracts vision researchers and practitioners from around the world. Being an academic conference, WACV emphasizes papers on systems and applications with significant, interesting vision components and is highly selective, with fewer than 30% of submissions accepted.

Kitware has supported WACV over the past few years as a sponsor, exhibitor, and presenter. This year, we are a Gold-level sponsor and will have an in-person exhibit space where we will highlight our ongoing research. Visit us to learn how we apply computer vision to solve challenging problems across sea, air, space, terrestrial, and internet domains. We are proud to have three papers accepted and to be co-chairing four workshops at WACV this year (see “Events” section below for more information).

Request a meeting with our team to discuss your project and how we can help you leverage our open source tools.

Computer Vision works in all domains, Space, Air, Land, Sea Surface and Undersea.

Learn more about our computer vision capabilities

Events Schedule

Paper Presentation

Thursday, January 5 from 2:15-3:30 PM GMT

MEVID: Multi-view Extended Videos with Identities for Video Person Re-Identification

Authors: Daniel Davila, Dawei Du, Bryon Lewis, Christopher Funk, Joseph Van Pelt, Roderick Collins, Kellie Corona, Matt Brown, Scott McCloskey, Anthony Hoogs, Brian Clipp (Kitware)

In this paper, we present the Multi-view Extended Videos with Identities (MEVID) dataset for large-scale, video person re-identification (ReID) in the wild. To our knowledge, MEVID represents the most-varied video person ReID dataset, spanning an extensive indoor and outdoor environment across nine unique dates in a 73-day window, various camera viewpoints, and entity clothing changes. While other datasets have more unique identities, MEVID emphasizes a richer set of information about each individual. Being based on the MEVA video dataset, we also inherit data that is intentionally demographically balanced to the continental United States. To accelerate the annotation process, we developed a semi-automatic annotation framework and GUI that combines state-of-the-art, real-time models for object detection, pose estimation, person ReID, and multi-object tracking. Link to paper ->

Track: 4B Tracking & Reidentification

Room: Naupaka VII

Paper Presentation

Friday, January 6 from 3-4 PM GMT

Reconstructing Humpty Dumpty: Multi-feature Graph Autoencoder for Open Set Action Recognition

Authors: Dawei Du (Kitware), Ameya Shringi, Christopher Funk (Kitware), Anthony Hoogs (Kitware)

This paper focuses on action recognition datasets and algorithms that operate within open set problems, where test samples maybe drawn from either known or unknown classes. Existing open set action recognition methods are typically based on extending closed set methods by adding post hoc analysis of classification scores or feature distances and do not capture the relations among all the video clip elements. Our approach uses the reconstruction error to determine the novelty of the video since unknown classes are harder to put back together and, therefore, have a higher reconstruction error than videos from known classes. Our solution is a novel graph-based autoencoder that accounts for contextual and semantic relations among the clip pieces for improved reconstruction. Link to Paper

Track: 8B Action Recognition

Room: Naupaka VII

Paper Presentation

Friday, January 6 from 3:15-5:15 GMT

Handling Image and Label Resolution Mismatch in Remote Sensing

Scott Workman, Armin Hadzic, M. Usman Rafique (Kitware)

There are unique challenges to semantic segmentation in the remote sensing domain. For example, the differences in ground sample distance results in a resolution mismatch between overhead imagery and ground-truth label sources. This paper presents a supervised method using low-resolution labels (without upsampling), that takes advantage of an exemplar set of high-resolution labels to guide the learning process. Our method incorporates region aggregation, adversarial learning, and self-supervised pre-training to generate fine-grained predictions, without requiring high-resolution annotations. Extensive experiments demonstrate the real-world applicability of our approach. Link to paper ->

Track: 9B Remote Sensing, Agriculture and Biology, Embedded & Real-Time, Few-Shot Learning

Room: Naupaka VII

Workshop

Tuesday, January 3, Full Day

Workshop on Maritime Computer Vision

Co-organizer: Matthew Dawkins (Kitware)

Over the past few years, many computer vision applications have emerged in the maritime and freshwater domains. Autonomous vehicles have made accessing maritime environments easier by providing the potential for automation on busy waterways and shipping routes and airborne applications. Computer vision plays an essential role in accurate navigation when operating these vehicles in busy traffic or close to the shores. This workshop aims to bring together Maritime Computer Vision researchers and promote deploying modern computer vision approaches in airborne and surface water domains.

Additional Information

Workshop

Tuesday, January 3, Full Day

2nd Workshop on Dealing with the Novelty in Open Worlds

Co-chairs: Christopher Funk, Ph.D. (Kitware), Dawei Du, Ph.D. (Kitware)

Computer vision algorithms are often developed inside a closed-world paradigm (e.g. recognizing objects from a fixed set of categories). However, the real-world is open, and constantly and dynamically changes. As a result, most computer vision algorithms can’t detect the change and continue to perform their tasks with incorrect and sometimes misleading predictions. Many real-world applications considered at WACV must deal with changing worlds where a variety of novelty is introduced (e.g., new classes of objects). In this workshop, we aim to facilitate research directions that operate well in the open-world while maintaining performance in the closed-world. We will explore mechanisms to measure competence at recognizing and adapting to novelty.

Additional Information

Workshop

Tuesday, January 3, Full Day

3rd Workshop on Real-World Surveillance: Applications and Challenges

Co-chair: Anthony Hoogs, Ph.D. (Kitware)

This workshop will cover topics related to the application of computer vision in real-world surveillance, the challenges associated with this surveillance, and mitigation strategy on topics such as object detection, scene understanding, and super-resolution. The workshop will also address legal and ethical issues of computer vision applications in these real-world scenarios, for example, detecting bias toward gender or race.

Additional Information

Workshop

Saturday, January 7, Full Day

Long-Range Recognition

Co-chair: Scott McCloskey, Ph.D. (Kitware)

Vision-based recognition in uncontrolled environments has been a topic of interest for researchers for decades. The addition of large standoff (lateral and/or vertical) distances between sensing platforms and the objects being sensed adds new challenges to this area. This workshop will cover some of the focused research programs addressing this issue, along with the supporting challenges of data collection, data curation, etc. and highlight architectures for multimodal recognition that seem to offer potential for strong performance. This workshop also aims to develop implicit consensus for current best practices on data-related matters and identify topics that need new focused attention.

Additional Information

Kitware is Hiring

Kitware’s Computer Vision Areas of Expertise

3D Reconstruction, Point Clouds, and Odometry

Computer generated graphics of maps and data

3D Reconstruction, Point Clouds, and Odometry

Kitware’s algorithms can extract 3D point clouds and surface meshes from video or images, without metadata or calibration information, or exploiting it when available. Operating on these 3D datasets or others from LiDAR and other depth sensors, our methods estimate scene semantics and 3D reconstruction jointly to maximize the accuracy of object classification, visual odometry, and 3D shape. Our open source 3D reconstruction toolkit, TeleSculptor, is continuously evolving to incorporate advancements to automatically analyze, visualize, and make measurements from images and video. LiDARView, another open source toolkit developed specifically for LiDAR data, performs 3D point cloud visualization and analysis in order to fuse data, techniques, and algorithms to produce SLAM and other capabilities.

WACV 2023

Leaders in Artificial Intelligence, Machine Learning, and Computer Vision

Learn more about our computer vision capabilities

Events Schedule

Kitware is Hiring

Kitware’s Computer Vision Areas of Expertise

3D Reconstruction, Point Clouds, and Odometry

Complex Activity, Event, and Threat Detection

Cyber-Physical Systems

Dataset Collection and Annotation

Generative AI, LLMs, VLMs

Responsible and Explainable AI

Geospatial Information Systems and Visualization

Multimedia Integrity Assurance

Interactive Artificial Intelligence and Human-Machine Teaming

Object Detection and Classification

Semantic Segmentation

Super Resolution and Enhancement

3D Reconstruction, Point Clouds, and Odometry

Complex Activity, Event, and Threat Detection

Cyber-Physical Systems

Dataset Collection and Annotation

Generative AI, LLMs, VLMs

Responsible and Explainable AI

Geospatial Information Systems and Visualization

Multimedia Integrity Assurance

Interactive Artificial Intelligence and Human-Machine Teaming

Object Detection and Classification

Semantic Segmentation

Super Resolution and Enhancement

Interested in learning more? Let’s talk!