Deep Multi-view Stereo for GTSFM

Liu, Ren

Title:

Deep Multi-view Stereo for GTSFM

dc.contributor.advisor	Dellaert, Frank
dc.contributor.author	Liu, Ren
dc.contributor.committeeMember	Hays, James
dc.contributor.committeeMember	Pradalier, Cedric
dc.contributor.department	Computer Science
dc.date.accessioned	2022-05-18T19:35:36Z
dc.date.available	2022-05-18T19:35:36Z
dc.date.created	2022-05
dc.date.issued	2022-05-03
dc.date.submitted	May 2022
dc.date.updated	2022-05-18T19:35:36Z
dc.description.abstract	Current structure-from-motion (SFM) pipelines integrated with multi-view stereo (MVS) module often applies traditional MVS algorithms. These algorithms cannot be well parallelized and can be slow when the data size and the resolution increase. For view synthesis, there are seldom SFM pipelines integrating it. This thesis focuses on how to integrate MVS and view synthesis efficiently into SFM pipelines, especially for the latest deep learning approaches. I first do a thorough survey in both domains, compare the advantages and disadvantages of the latest studies, and select the best-fit approach for our distributed SFM pipeline, Georgia Tech Structure from Motion (GTSFM). We implement the deep multi-view optimizer with PatchmatchNet and integrate it into the working graph of GTSFM. We also design an algorithm to boost a novel deep view synthesis algorithm, Instant-NGP, by forcing the reconstruction region on the overlapping Field-of-Views. This also enables us to extract high-quality dense polygon meshes of foreground objects directly from the reconstructed depth field. Experiments run on DTU and Skydio Crane Mast datasets suggest our MVS approach is more efficient than some popular SFM pipelines with MVS implemented.
dc.description.degree	M.S.
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/1853/66613
dc.language.iso	en_US
dc.publisher	Georgia Institute of Technology
dc.subject	Multi-view stereo
dc.subject	View synthesis
dc.subject	Structure-from-motion
dc.subject	Deep learning
dc.title	Deep Multi-view Stereo for GTSFM
dc.type	Text
dc.type.genre	Thesis
dspace.entity.type	Publication
local.contributor.advisor	Dellaert, Frank
local.contributor.corporatename	College of Computing
relation.isAdvisorOfPublication	dac80074-d9d8-4358-b6eb-397d95bdc868
relation.isOrgUnitOfPublication	c8892b3c-8db6-4b7b-a33a-1b67f7db2021
thesis.degree.level	Masters