Loading Events

MSR Speaking Qualifier

May

10
Fri
Anqi Yang Robotics Institute,
Carnegie Mellon University
Friday, May 10
4:00 pm to 5:30 pm
NSH 4305
Anqi Yang – MSR Thesis Talk

Title: 3D Object Detection from CT Scans using a Slice-and-fuse Approach

 

Abstract:

Automatic object detection in 3D X-ray Computed Tomography imagery has recently gained research attention due to its promising applications in aviation baggage screening. The huge dimension of an individual 3D scan, however, poses formidable computational challenges when coupled with deep 3D convolutional networks for inference. In this thesis, we propose the slice-and-fuse strategy — a generic framework to leverage image-based detection and segmentation in high-dimensional 3D volumes. We encode the input 3D volumes into multiple slices along XY, YZ, and XZ directions, exploit 2D CNNs to generate 2D predictions, and then fuse 2D predictions to 3D estimation. Using the proposed strategy, we design two 3D object detectors for 3D baggage CT scans. Retinal-SliceNet uses a unified, single network to detect target objects from the input 3D CT scans. U-SliceNet exploits a two-stage paradigm, first generating proposals using a voxel labeling network and then refining the proposals by a 3D classification network. U-SliceNet generates high-quality segmentation masks along with bounding boxes for target objects. We evaluate the two SliceNets on a large-scale 3D baggage CT dataset for three tasks: baggage classification, 3D object detection, and 3D semantic segmentation.

 

Committee:

Aswin Sankaranarayanan (Co-advisor)

Srinivasa Narasimhan (Co-advisor)

David Held

Jen-Hao Chang