A lot of the underlying machinery is similar: they both rely on "structure from motion" (sfm) techniques to automatically estimate both camera locations and 3d geometry of the scene simultaneously. And both works come from the same lab: the GRAIL group at the University of Washington.
(I postdoced in that lab for 3 years and the authors of this paper are friends/former colleagues.)
More than people realize... Google and apple, for example, use sfm heavily to compute their 3d maps (in Google earth and the apple equivalent). Google also uses it in various other products.
(I postdoced in that lab for 3 years and the authors of this paper are friends/former colleagues.)