With different types of cameras, the 3D content capturing process is completely different. The stereo camera or depth camera simultaneously captures video and associated per-pixel depth or disparity information; multi-view cameras capture multiple images simultaneously from various angles, then a multi-view matching (or correspondence) process is required to generate the disparity map for each pair of cameras, and then the 3D structure can be estimated from these disparity maps. The most challenging scenario is to capture 3D content from a normal 2D (or monoscopic) camera, which lacks disparity or depth information. In this chapter, we showcase a real-time 3D capturing system with a monoscopic mobile phone in Section 4.2.4.
Basically, a typical stereo camera uses two cameras mounted side by side for the recording, although some variants may build them into one with two lenses. We list below a few examples: