Title:
Deep Multi-view Stereo for GTSFM
Deep Multi-view Stereo for GTSFM
dc.contributor.advisor | Dellaert, Frank | |
dc.contributor.author | Liu, Ren | |
dc.contributor.committeeMember | Hays, James | |
dc.contributor.committeeMember | Pradalier, Cedric | |
dc.contributor.department | Computer Science | |
dc.date.accessioned | 2022-05-18T19:35:36Z | |
dc.date.available | 2022-05-18T19:35:36Z | |
dc.date.created | 2022-05 | |
dc.date.issued | 2022-05-03 | |
dc.date.submitted | May 2022 | |
dc.date.updated | 2022-05-18T19:35:36Z | |
dc.description.abstract | Current structure-from-motion (SFM) pipelines integrated with multi-view stereo (MVS) module often applies traditional MVS algorithms. These algorithms cannot be well parallelized and can be slow when the data size and the resolution increase. For view synthesis, there are seldom SFM pipelines integrating it. This thesis focuses on how to integrate MVS and view synthesis efficiently into SFM pipelines, especially for the latest deep learning approaches. I first do a thorough survey in both domains, compare the advantages and disadvantages of the latest studies, and select the best-fit approach for our distributed SFM pipeline, Georgia Tech Structure from Motion (GTSFM). We implement the deep multi-view optimizer with PatchmatchNet and integrate it into the working graph of GTSFM. We also design an algorithm to boost a novel deep view synthesis algorithm, Instant-NGP, by forcing the reconstruction region on the overlapping Field-of-Views. This also enables us to extract high-quality dense polygon meshes of foreground objects directly from the reconstructed depth field. Experiments run on DTU and Skydio Crane Mast datasets suggest our MVS approach is more efficient than some popular SFM pipelines with MVS implemented. | |
dc.description.degree | M.S. | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | http://hdl.handle.net/1853/66613 | |
dc.language.iso | en_US | |
dc.publisher | Georgia Institute of Technology | |
dc.subject | Multi-view stereo | |
dc.subject | View synthesis | |
dc.subject | Structure-from-motion | |
dc.subject | Deep learning | |
dc.title | Deep Multi-view Stereo for GTSFM | |
dc.type | Text | |
dc.type.genre | Thesis | |
dspace.entity.type | Publication | |
local.contributor.advisor | Dellaert, Frank | |
local.contributor.corporatename | College of Computing | |
relation.isAdvisorOfPublication | dac80074-d9d8-4358-b6eb-397d95bdc868 | |
relation.isOrgUnitOfPublication | c8892b3c-8db6-4b7b-a33a-1b67f7db2021 | |
thesis.degree.level | Masters |