fbpx
Wikipedia

Tomasi–Kanade factorization

The Tomasi–Kanade factorization is the seminal work by Carlo Tomasi and Takeo Kanade in the early 1990s.[1] It charted out an elegant and simple solution based on a SVD-based factorization scheme for analysing image measurements of a rigid object captured from different views using a weak perspective camera model. The crucial observation made by authors was that if all the measurements (i.e., image co-ordinates of all the points in all the views) are collected in a single matrix, the point trajectories will reside in a certain subspace. The dimension of the subspace in which the image data resides is a direct consequence of two factors:

  1. The type of camera that projects the scene (for example, affine or perspective)
  2. The nature of inspected object (for instance, rigid or non-rigid).

The low-dimensionality of the subspace is mirrored (captured) trivially as reduced rank of the measurement matrix. This reduced rank of measurement matrix can be motivated from the fact that, the position of the projection of an object point on the image plane is constrained as the motion of each point is globally described by a precise geometric model.

Method edit

The rigid-body factorization introduced in provides a description of 3D structure of a rigid object in terms of a set of feature points extracted from salient image features. After tracking the points throughout all the images composing the temporal sequence, a set of trajectories is available. These trajectories are constrained globally at each frame by the rigid transformation which the shape is undergoing, i.e., trajectory of every point will have similar profile.

Let the location of a point j in a frame i be defined as pij = (xij, yij)T where xij and yij are horizontal and vertical image co-ordinates respectively .

A compact representation of the image measurements can be expressed by collecting all the non-homogeneous co-ordinates in a single matrix, called the observation matrix P such that

 

P is a 2F × N matrix, where F is the number of frames and N the number of feature points. Ideally, the observation matrix, should contain perfect information about the object being tracked. Unfortunately, in practice, most state-of-art trackers can only provide point tracks that are incomplete (due to occlusion) and inaccurate (due to sensor noise) if placed in an unstructured environment.

As mentioned earlier, the central premise behind the factorization approach is that a measurement matrix P is rank limited. Further, it is possible to factor P into two sub-matrices: a motion and a shape matrix, M and S of size 2F × r and N × r respectively.

 

The size and structure of S generally depends on the shape properties (for example whether it is rigid or non-rigid) and M depends both on the type of camera model we assume and the shape properties. The essence of factorization method is computing

The optimal r-rank approximation of P with respect to the Frobenius norm can be found out using a SVD-based scheme.

References edit

  1. ^ Carlo Tomasi and Takeo Kanade. (November 1992). "Shape and motion from image streams under orthography: a factorization method". International Journal of Computer Vision. 9 (2): 137–154. CiteSeerX 10.1.1.131.9807. doi:10.1007/BF00129684. S2CID 2931825.

See also edit

tomasi, kanade, factorization, this, article, multiple, issues, please, help, improve, discuss, these, issues, talk, page, learn, when, remove, these, template, messages, this, article, technical, most, readers, understand, please, help, improve, make, underst. This article has multiple issues Please help improve it or discuss these issues on the talk page Learn how and when to remove these template messages This article may be too technical for most readers to understand Please help improve it to make it understandable to non experts without removing the technical details August 2018 Learn how and when to remove this message The topic of this article may not meet Wikipedia s general notability guideline Please help to demonstrate the notability of the topic by citing reliable secondary sources that are independent of the topic and provide significant coverage of it beyond a mere trivial mention If notability cannot be shown the article is likely to be merged redirected or deleted Find sources Tomasi Kanade factorization news newspapers books scholar JSTOR April 2020 Learn how and when to remove this message This article relies largely or entirely on a single source Relevant discussion may be found on the talk page Please help improve this article by introducing citations to additional sources Find sources Tomasi Kanade factorization news newspapers books scholar JSTOR April 2020 This article relies excessively on references to primary sources Please improve this article by adding secondary or tertiary sources Find sources Tomasi Kanade factorization news newspapers books scholar JSTOR April 2020 Learn how and when to remove this message Learn how and when to remove this message The Tomasi Kanade factorization is the seminal work by Carlo Tomasi and Takeo Kanade in the early 1990s 1 It charted out an elegant and simple solution based on a SVD based factorization scheme for analysing image measurements of a rigid object captured from different views using a weak perspective camera model The crucial observation made by authors was that if all the measurements i e image co ordinates of all the points in all the views are collected in a single matrix the point trajectories will reside in a certain subspace The dimension of the subspace in which the image data resides is a direct consequence of two factors The type of camera that projects the scene for example affine or perspective The nature of inspected object for instance rigid or non rigid The low dimensionality of the subspace is mirrored captured trivially as reduced rank of the measurement matrix This reduced rank of measurement matrix can be motivated from the fact that the position of the projection of an object point on the image plane is constrained as the motion of each point is globally described by a precise geometric model Method editThe rigid body factorization introduced in provides a description of 3D structure of a rigid object in terms of a set of feature points extracted from salient image features After tracking the points throughout all the images composing the temporal sequence a set of trajectories is available These trajectories are constrained globally at each frame by the rigid transformation which the shape is undergoing i e trajectory of every point will have similar profile Let the location of a point j in a frame i be defined as pij xij yij T where xij and yij are horizontal and vertical image co ordinates respectively A compact representation of the image measurements can be expressed by collecting all the non homogeneous co ordinates in a single matrix called the observation matrix P such that P x 11 x 1 N x F 1 x F N y 11 y 1 N y F 1 y F N displaystyle mathbf P left begin array ccc x 11 amp cdots amp x 1N vdots amp ddots amp vdots x F1 amp cdots amp x FN y 11 amp cdots amp y 1N vdots amp ddots amp vdots y F1 amp cdots amp y FN end array right nbsp P is a 2F N matrix where F is the number of frames and N the number of feature points Ideally the observation matrix should contain perfect information about the object being tracked Unfortunately in practice most state of art trackers can only provide point tracks that are incomplete due to occlusion and inaccurate due to sensor noise if placed in an unstructured environment As mentioned earlier the central premise behind the factorization approach is that a measurement matrix P is rank limited Further it is possible to factor P into two sub matrices a motion and a shape matrix M and S of size 2F r and N r respectively P M S T displaystyle mathbf P mathbf M mathbf S T nbsp The size and structure of S generally depends on the shape properties for example whether it is rigid or non rigid and M depends both on the type of camera model we assume and the shape properties The essence of factorization method is computingThe optimal r rank approximation of P with respect to the Frobenius norm can be found out using a SVD based scheme References edit Carlo Tomasi and Takeo Kanade November 1992 Shape and motion from image streams under orthography a factorization method International Journal of Computer Vision 9 2 137 154 CiteSeerX 10 1 1 131 9807 doi 10 1007 BF00129684 S2CID 2931825 See also editStructure from motion Retrieved from https en wikipedia org w index php title Tomasi Kanade factorization amp oldid 983927759, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.