Dynamic Point Cloud Compression Based on Projections, Surface Reconstruction and Video Compression

Emil Dumic; Anamaria Bjelopera; Andreas Nüchter

doi:10.3390/s22010197

. 2021 Dec 28;22(1):197. doi: 10.3390/s22010197

Dynamic Point Cloud Compression Based on Projections, Surface Reconstruction and Video Compression

Emil Dumic ^1,^*, Anamaria Bjelopera ², Andreas Nüchter ³

Editor: Steve Vanlanduit

PMCID: PMC8749693 PMID: 35009738

Abstract

In this paper we will present a new dynamic point cloud compression based on different projection types and bit depth, combined with the surface reconstruction algorithm and video compression for obtained geometry and texture maps. Texture maps have been compressed after creating Voronoi diagrams. Used video compression is specific for geometry (FFV1) and texture (H.265/HEVC). Decompressed point clouds are reconstructed using a Poisson surface reconstruction algorithm. Comparison with the original point clouds was performed using point-to-point and point-to-plane measures. Comprehensive experiments show better performance for some projection maps: cylindrical, Miller and Mercator projections.

Keywords: 3DTK toolkit, map projections, point cloud compression, point-to-point measure, point-to-plane measure, Poisson surface reconstruction, octree

1. Introduction

A point cloud represents a set of discrete points in a given coordinate system, usually in a three-dimensional Cartesian system. It can represent different objects, urban settings and landscapes, or any other physical entities in different use cases such as: computer graphics and gaming, virtual reality, 3D content creation, medical applications, construction and manufacturing, consumer and retail, cultural heritage, remote sensing, autonomous vehicles, surveillance etc. [1]. Point clouds, together with light fields and digital holography are described as plenoptic representations of the visual scene, that are used in many immersive applications [2]. Point cloud processing algorithms can be considered in different scenarios, such as acquisition [3,4], coding and transmission [5,6] and (re)presentation and display [7,8].

Due to the data size of a static and dynamic point cloud, it is important to define efficient compression techniques for storage and compression. In this paper, we will present a dynamic point cloud compression that can be used in a point cloud transmission system. A basic overview has been presented in [9], while in [10] several static point clouds (with different sizes, e.g., number of points) have been compressed and decompressed using the equirectangular projection with a similar number of points as in the other used octree-based compression. In this paper we will present a comprehensive comparison of 10 different projection types, combined with the video compression algorithms for geometry and texture, and finally test surface reconstruction algorithms to fill the holes present after the lossy point cloud decompression process.

The structure of this article is as follows. Section 2 presents a related overview of point cloud compression algorithms. Section 3 presents different projection types that are used for projecting 3D point cloud to the image and vice versa. Section 4 describes the creation of the panorama image from a point cloud and how to recreate a point cloud from a panorama image. Section 5 gives the results from different projection types described in the previous section, using different objective quality measures and finally Section 6 gives the conclusions.

2. Related Work

Several static and dynamic point cloud coding solutions have been proposed recently. In [11] the authors describe an efficient octree implementation that is used to store, compress or visualize large point clouds. Storing is done without loss of precision, while reducing the size of a point cloud to half the original size. Also, 3D scan matching has been tested using the proposed method, by implementing the octree for nearest neighbor search (NNS) algorithm. The Random Sample Consensus (RANSAC) algorithm is also sped up by using the octree data structure. In the related work [12], shape registration algorithms have been compared using different NNS strategies, with the octree-based method being one of them. The Octree implementation (with arbitrarily chosen octree depth) is given in the “3DTK—The 3D Toolkit” [13].

Different from octree-based compression, in [14] the authors presented a point cloud compression algorithm based on projections. Different panorama generation methods have been tested, using several projection types: equirectangular, cylindrical, Mercator, rectilinear, Pannini, stereographic and Albers equal-area conic projections. It is shown that the reduced point clouds are useful for feature based registration on panorama images. In [15] point cloud compression scheme is proposed using panorama generated images and an equirectangular projection type. Range, color and reflectance information is encoded for each point, by using 24 bits for range (and storing it in an RGB image), 24 bits for color and 8 bits for reflectance. Also, lossless and lossy compression methods have been tested for obtained panorama images. In the case of lossy JPEG compression, an additional method is needed to remove artefacts from the decompressed point cloud.

Similarily, MPEG recently proposed two new point cloud compression codecs, named G-PCC (Geometry based Point Cloud Compression) codec [16] and V-PCC (Video based Point Cloud Compression) codec [17]. The G-PCC was created by merging two previously defined codecs, the L-PCC (LIDAR point cloud compression for dynamic point clouds) and the S-PCC (Surface point cloud compression for for static point clouds). Currently, G-PCC supports only intra prediction, so it does not use temporal redundancies. G-PCC compresses point clouds directly in 3D space. Currently, lossless mode provides up to a 10:1 compression ratio, while in lossy mode it is possible to obtain up to a 35:1 compression ratio with acceptable quality. In the MPEG’s V-PCC codec, the basic idea is to project point cloud from 3D to 2D, so that 2D projections are encoded using existing 2D video encoders such as H.265/HEVC video compression [18]. The current V-PCC encoder compresses dynamic point cloud with a compression ratio up to 125:1, with acceptable quality. More details about G-PCC and V-PCC are given in [19,20]. A comprehensive overview of G-PCC and V-PCC rate-distortion coding performance can be also found in [21]. A quality evaluation study of point cloud codecs G-PCC and V-PCC is presented in [22], showing the superior compression performance of the MPEG V-PCC compared to the MPEG G-PCC, for the selected static contents.

Neural network based point cloud compression schemes have also been proposed recently. In [23], the authors present point cloud compression using neural networks for separate and joint compression of geometry and texture. Better results are obtained for geometry and competitive results for color coding at low bitrates, comparing with CWI-PCL, the MPEG anchor codec is presented in [24]. In the paper [25], authors presented a new method for static point cloud geometric compression. It is based on the learned convolutional transform and uniform quantization. Compared to the MPEG reference software, the proposed algorithm achieves on average 51.5% BD-Rate (Bjöntegaard Delta Rate) savings, using the Microsoft Upper Body dataset [26]. The updated algorithm is presented in [27]. In the paper [28] a learned point cloud geometry compression is proposed, utilizing deep neural network-based variational autoencoders. The proposed algorithm shows higher compression efficiency than that of MPEG’s G-PCC, having at least 60% BD-Rate savings, tested on several datasets. The updated algorithm is proposed in [29]. Similar to [27] authors in [30] present the deep-learning coding approach to static point cloud geometry coding where a voxelized input point cloud is divided into 3D coding blocks of a fixed size and only non-empty 3D blocks are coded. The encoder transforms the input data into latent representations with lower dimensionality forcing a network to extract important features. The autoencoder learns the transform and its inverse operation suitable for the target data compared to image-transform coding where the transform basis functions are fixed. Performance results show improvements over the PCL (Point Cloud Library) [31]. The proposed compression from [30] is improved in [32] by adding the variational autoencoder which captures structure information still present on latent features and therefore entropy coding model parameters can be estimated on the encoder side and replicated on the decoder side more accurately. More importantly the authors add the resolution scalability via interlaced sub-sampling which not only increases the number of decoded points but also gives a good point cloud quality from a subjective point of view. Furthermore, these authors proposed adaptive deep learning-based static point cloud geometry coding which can adapt to any generic point cloud content to maximize the RD performance [33]. The author in [34] presents an adversarial autoencoding strategy for voxelized point cloud geometry where the main idea is to code each point cloud element independently and to decode it using a lower resolution reconstruction as side information. The encoder generates hash bytes, and the decoder combines them with side information to reconstruct the original block. The reconstructed block can be classified with an adversarial discriminator which is a regularizer for the reconstruction process thus improving the coding performance. A neural network architecture using a predictive coding module at the decoder stage for bit-rate reduction of geometry only point clouds is described in [35]. In this approach a block can be encoded independently or predicted using its neighbors based on the quality of the reconstructed block and the local topology of the model. Alternative to the mentioned block-by-block processing approaches point based models are used. In [36] the authors create an architecture consisting of a pointnet-based encoder, an uniform quantizer, an entropy estimation block and a nonlinear synthesis transformation module and in [37] a hierarhical autoencoder with multiscale loss function is presented. These architectures are insufficient for the processing of large point cloud data. VoxelDNN was proposed in [38] which combines the octree and voxel domains. Inference in this lossless compression is slow, and the occupancy probabilities are predicted sequentially, voxel by voxel, while the improved MSVoxelDNN models voxel occupancy and achieves rate savings over G-PCC up to 17% on average [39]. One of the new methods is presented in [40] and it brings a solution that can be applied to both static and dynamic point cloud compression. It employs the voxel context based entropy model and for dynamic PC compression temporal dependency is exploited. In the work of [41] detachable learning-based residual coding solution is created where the residual module enhances the decoded model quality at the expense of added bitrate.

3. Projection Types and Their Description

In this section, we describe different projection types and their inverse solutions that is later used in the process of obtaining 2D image projection from point clouds. Several of them are adopted from [15,42,43]. For each projection type, example panorama images for the geometry and texture are created using the point cloud “longdress_vox10_1060.ply” from Longdress dynamic point cloud dataset [44].

A point cloud geometry is first transformed from a Cartesian to a spherical coordinate system. Afterwards, 2D geometry panorama image coordinates (row number x and column number y) are calculated from the angle information (longitude $θ$ and latitude $φ$ ), while the intensity (bit depth) of the 2D geometry panorama represents radius in spherical coordinates. An attribute panorama image, i.e., a 2D image representing color information in our case, is created by using the same row number x and column number y as for the geometry, with the same color as the point cloud point that it represents.

3.1. Lambert Azimuthal Equal-Area Projection

Transformation equations and the inverse formulas are given in Equations (1) and (2). $φ_{1}$ is the standard parallel, while $θ_{0}$ is the central longitude.

\begin{matrix} x = k^{'} cos φ sin (θ - θ_{0}), \\ y = k^{'} (cos φ_{1} sin φ - sin φ_{1} cos φ cos (θ - θ_{0})), \\ k^{'} = \sqrt{\frac{2}{1 + sin φ_{1} sin φ + cos φ_{1} cos φ cos (θ - θ_{0})}} . \end{matrix}

(1)

The inverse formulas are given in Equation (2):

\begin{matrix} θ = arctan \frac{x sin C}{ρ cos φ_{1} cos C - y sin φ_{1} sin C} + θ_{0}, \\ φ = arcsin (cos C sin φ_{1} + \frac{y sin C cos φ_{1}}{ρ}), \\ ρ = \sqrt{x^{2} + y^{2}}, \\ C = 2 arcsin \frac{ρ}{2} . \end{matrix}

(2)

An example of panorama images is given in Figure 1.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the Lambert azimuthal equal-area projection, for the geometry (a) and texture (b).

3.2. Albers Equal-Area Conic Projection

Transformation equations and the inverse formulas are given in Equations (3) and (4). Usually, $(φ_{0}, θ_{0}) = (0^{\circ}, 0^{\circ})$ , while $φ_{1}$ and $φ_{2}$ represent minimum and maximum latitude.

\begin{matrix} x = ρ sin (N (θ - θ_{0})), \\ y = ρ_{0} - ρ cos (N (θ - θ_{0})), \end{matrix}

where:

\begin{matrix} N = \frac{1}{2} (sin φ_{1} + sin φ_{2}), \\ C = cos^{2} φ_{1} + 2 N sin φ_{1}, \\ ρ_{0} = \frac{1}{N} \sqrt{C - 2 N sin φ_{0}}, \\ ρ = \frac{1}{N} \sqrt{C - 2 N sin φ} . \end{matrix}

(3)

The inverse formulas are given in Equation (4):

\begin{matrix} θ = θ_{0} + \frac{1}{N} arctan \frac{x}{ρ_{0} - y}, \\ φ = arcsin \frac{C - (x^{2} + {(ρ_{0} - y)}^{2}) N^{2}}{2 N} . \end{matrix}

(4)

An example of panorama images is given in Figure 2.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the Albers equal-area conic projection, for the geometry (a) and texture (b).

3.3. Cylindrical Projection

Cylindrical projection is similar to the equirectangular projection, however vertical coordinate is tangent of the latitude. Transformation equations and the inverse formulas are given in Equations (5) and (6). Usually, $(φ_{0}, θ_{0}) = (0^{\circ}, 0^{\circ})$ . Later in the experiments, we will “compress” all $φ$ latitude angles before creating panorama images, i.e., multiply $φ$ with 0.825, and “decompress” (divide by 0.825) after recreating point cloud. This is because tangent function is not defined for angles near $\pm 90^{\circ}$ .

\begin{matrix} x = θ - θ_{0}, \\ y = tan φ - tan φ_{0} . \end{matrix}

(5)

The inverse formulas are given in Equation (6):

\begin{matrix} θ = x + θ_{0}, \\ ϕ = arctan (y + tan φ_{0}) . \end{matrix}

(6)

An example of panorama images is given in Figure 3.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the cylindrical projection, for the geometry (a) and texture (b).

3.4. Cylindrical Equal-Area Projection

Transformation equations and the inverse formulas are given in Equations (7) and (8). $θ_{0}$ is the standard longitude, while $φ_{s}$ is the standard latitude, e.g., for $φ_{s}$ = 0 it is called Lambert cylindrical equal-area projection.

\begin{matrix} x = (θ - θ_{0}) c o s φ_{s}, \\ y = \frac{sin φ}{cos φ_{s}} . \end{matrix}

(7)

The inverse formulas are given in Equation (8):

\begin{matrix} θ = \frac{x}{cos φ_{s}} + θ_{0}, \\ φ = arcsin (y cos φ_{s}) . \end{matrix}

(8)

An example of panorama images is given in Figure 4.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the cylindrical equal-area projection, for the geometry (a) and texture (b).

3.5. Equidistant Cylindrical Projection

Transformation equations and the inverse formulas for equidistant cylindrical projection are given in Equations (9) and (10). An equirectangular projection, one of the most common projection types, is a type of cylindrical equidistant projection. Horizontal coordinate x in this projection type is the longitude $θ$ , while the vertical coordinate y is the latitude $φ$ . $φ_{1}$ represents standard parallels (north and south of the equator) where the scale of the projection is true. For $(φ_{0}, θ_{0}) = (0^{\circ}, 0^{\circ})$ and $cos φ_{1}$ = 0 it is called the equirectangular projection, Equation (9).

\begin{matrix} x = (θ - θ_{0}) cos φ_{1}, \\ y = φ - φ_{0} . \end{matrix}

(9)

The inverse formulas are given in Equation (10):

\begin{matrix} θ = \frac{x}{cos φ_{1}} + θ_{0}, \\ ϕ = y + ϕ_{0} . \end{matrix}

(10)

An example of panorama images is given in Figure 5.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the equirectangular projection, for the geometry (a) and texture (b).

3.6. Mercator Projection

Transformation equations and the inverse formulas are given in Equations (11) and (12). Problems may arise at latitudes $φ$ near $\pm 90^{\circ}$ . Usually, $(φ_{0}, θ_{0}) = (0^{\circ}, 0^{\circ})$ . Similar to the cylindrical projection, later in the experiments we will “compress” all $φ$ latitude angles, multiplying them with the factor 0.825, to create a panorama image (and “decompress”—divide them with the factor 0.825 to recreate point clouds).

\begin{matrix} x = θ - θ_{0}, \\ y = ln (tan φ + \frac{1}{cos φ}) - y_{0}, \\ y_{0} = ln (tan φ_{0} + \frac{1}{cos φ_{0}}) \end{matrix}

(11)

The inverse formulas are given in Equation (12):

\begin{matrix} θ = x + θ_{0}, \\ φ = 2 arctan e^{y + y_{0}} - \frac{π}{2} . \end{matrix}

(12)

An example of panorama images is given in Figure 6.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the Mercator projection, for the geometry (a) and texture (b).

3.7. Miller Projection

The Miller projection is a modified Mercator projection. Transformation equations and the inverse formulas are given in Equations (13) and (14). Problems that may arise using the Mercator projection for latitudes $ϕ$ near $\pm 90^{\circ}$ are not present in the Miller projection. Usually, $(ϕ_{0}, θ_{0}) = (0^{\circ}, 0^{\circ})$ .

\begin{matrix} x = θ - θ_{0}, \\ y = \frac{5}{4} ln [tan (\frac{π}{4} + \frac{2 φ}{5})] - y_{0} . \\ y_{0} = \frac{5}{4} ln [tan (\frac{π}{4} + \frac{2 φ_{0}}{5})] . \end{matrix}

(13)

The inverse formulas are given in Equation (14):

\begin{matrix} θ = x + θ_{0}, \\ φ = \frac{5}{2} arctan e^{\frac{4 (y + y_{0})}{5}} - \frac{5 π}{8} . \end{matrix}

(14)

An example of panorama images is given in Figure 7.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the Miller projection, for the geometry (a) and texture (b).

3.8. Rectilinear Projection

Transformation equations and the inverse formulas are given in Equations (15) and (16). $θ_{0}$ and $φ_{1}$ are the longitude and latitude of the center of the projection. It is recommended to use this projection for the horizontal and vertical angles of less than $120^{\circ}$ . Therefore, the panorama has to be divided into smaller subsets, e.g., minimum 3 (later we will use 4). In addition, for greater vertical angles, all angles are scaled to fit in less than $\pm 90^{\circ}$ . Later in the experiments, similar to the cylindrical and Mercator projections, we will “compress” and “decompress” all of the $φ$ latitude angles, by multiplying and dividing them by 0.825.

\begin{matrix} x = \frac{cos φ sin (θ - θ_{0})}{sin φ_{1} sin φ + cos φ_{1} cos φ cos (θ - θ_{0})}, \\ y = \frac{cos φ_{1} sin φ - sin φ_{1} cos φ cos (θ - θ_{0})}{sin φ_{1} sin φ + cos φ_{1} cos φ cos (θ - θ_{0})} . \end{matrix}

(15)

The inverse formulas are given in Equation (16):

\begin{matrix} θ = θ_{0} + arctan \frac{x sin C}{ρ cos φ_{1} cos C - y sin φ_{1} sin C}, \\ φ = arcsin (cos C sin φ_{1} + \frac{y sin C cos φ_{1}}{ρ}), \\ ρ = \sqrt{x^{2} + y^{2}}, \\ C = arctan ρ . \end{matrix}

(16)

An example of panorama images is given in Figure 8.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the rectilinear projection, for the geometry (a) and texture (b).

3.9. Pannini Projection

Transformation equations and the inverse formulas are given in Equations (17) and (18). $θ_{0}$ and $φ_{1}$ are the longitude and latitude of the center of the projection. Parameter d can be any non-negative number. For $d = 0$ we have a rectilinear projection, while $d = 1$ is the usual Pannini projection, used in the later experiments. It is recommended to use this projection for horizontal and vertical angles of less than $150^{\circ}$ . Therefore, the panorama has to be divided into smaller subsets, e.g., minimum of 3 (later we will use 4). In addition, for greater vertical angles, all angles are scaled to fit in less than $\pm 90^{\circ}$ . For this projection type, later in the experiments we will “compress” and “decompress” all $φ$ latitude angles, multiplying and dividing them with the factor 0.825.

\begin{matrix} x = \frac{(d + 1) sin (θ - θ_{0})}{d + sin φ_{1} tan φ + cos φ_{1} cos (θ - θ_{0})}, \\ y = \frac{(d + 1) tan φ (cos φ_{1} - sin φ_{1} (\frac{1}{tan φ}) cos (θ - θ_{0}))}{d + sin φ_{1} tan φ + cos φ_{1} cos (θ - θ_{0})} . \end{matrix}

(17)

The inverse formulas are given in Equation (18):

\begin{matrix} A = \frac{y}{x cos φ_{1}}, \\ B = tan φ_{1}, \\ C = A x sin φ_{1} - d - 1, \\ D = B x sin φ_{1} + x cos φ_{1}, \\ E = - x d . \end{matrix}

(18)

Finally we yield, Equation (19):

\begin{matrix} C sin (θ - θ_{0}) + D cos (θ - θ_{0}) = E \\ θ = θ_{0} + arccos (\frac{E}{\sqrt{C^{2} + D^{2}}}) + arctan (\frac{C}{D}) \\ φ = arctan (A sin (θ - θ_{0}) + B cos (θ - θ_{0})) . \end{matrix}

(19)

An example of panorama images is given in Figure 9.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the Pannini projection, for the geometry (a) and texture (b).

3.10. Stereographic Projection

Transformation equations and the inverse formulas are given in Equations (20) and (21). $θ_{0}$ and $φ_{1}$ are the longitude and latitude of the center of the projection. It is advisable to use a $120^{\circ}$ longitude, e.g., to divide the image into at least 3 subsets (later we will use 4).

\begin{matrix} x = \frac{2 R cos φ sin (θ - θ_{0})}{1 + sin φ_{1} sin φ + cos φ_{1} cos φ cos (θ - θ_{0})}, \\ y = \frac{2 R (cos φ_{1} sin φ - sin φ_{1} cos φ cos (θ - θ_{0}))}{1 + sin φ_{1} sin φ + cos φ_{1} cos φ cos (θ - θ_{0})} . \end{matrix}

(20)

The inverse formulas are given in Equation (21):

\begin{matrix} θ = θ_{0} + arctan \frac{x {i n C}{ρ cos φ_{1} cos C - y sin φ_{1} sin C}, \\ φ = arcsin (cos C sin φ_{1} + \frac{y sin C cos φ_{1}}{ρ}), \\ ρ = \sqrt{x^{2} + y^{2}}, \\ C = 2 arctan (\frac{ρ}{2 R}) . \end{matrix}

(21)

An example of panorama images is given in Figure 10.

Panorama images created using point cloud “longdress_vox10_1060.ply” from the stereographic projection, for the geometry (a) and texture (b).

4. Creation of Panorama Images from Point Clouds and Recreation of Point Clouds

In this section we will describe how to calculate panorama images from point clouds, as well as how to recreate point clouds from obtained panorama images. Programs that were used are 3DTK toolkit [13], MeshLab [45], CloudCompare [46], FFmpeg [47] and MATLAB for scripts.

For this example, we will use the Miller projection, although any other projection described earlier can be used in a similar way. Also, we will use Longdress point cloud, first 20 frames [44]. This point cloud is a voxelized point cloud with a bounding box size of 1024 × 1024 × 1024, i.e., with 10-bit precision per each coordinate. Additionally, texture (color) for each point is represented with 24-bit RGB format, i.e., with 8 bits per color channel. Overall, $3 \times 10 + 3 \times 8 = 54$ bits per point are used, in the ideal case; however, depending on the used format, even the binary format might occupy much more space, as we present later.

The input point cloud is read in MATLAB and an offset is added to all its points, so that the bounding box center is at (0, 0, 0). Afterwards, it is scaled so that the maximum distance between any point and the origin (0, 0, 0) does not exceed $2^{b i t_n u m} - 1 = 65, 535$ for a 16 bit panorama grayscale image, Equation (22).

\begin{matrix} scale_factor = \frac{K \cdot (2^{bit_num} - 1)}{max (\sqrt{(x x {(i)}^{2} + y y {(i)}^{2} + z z {(i)}^{2})})} \\ bit_num = 16 (in the later case, but could be also 24 for some other point clouds) \\ K = 0.01 (due to the later scaling made by the 3 DTK toolkit) \\ i \in {1, \dots, N}, N = number of points \\ x x (i), y y (i), z z (i) are Cartesian coordinates of the point i \end{matrix}

(22)

Both geometry and texture are represented using the same projection. In the case of geometry, we use 16 bit input grayscale .png for the later compression. However, approximately 9 bits may be enough for the tested point cloud: the maximum distance from the center is 512, before scaling, but the number may not be integer, so some precision loss may occur for the 9 bit representation. For larger point clouds, 24-bit representation may also be used, which is also implemented in the 3DTK toolkit, storing geometry in an RGB image [15]. In the case of texture, we additionally create a Voronoi diagram using OpenCV “distanceTransform” function and L2 norm (while creating the texture image in the 3DTK toolkit), Listing 1. A Voronoi diagram (tessellation, decomposition) is the partitioning of a plane with n points into convex polygons with exactly one generating point in each polygon and every point in a particular polygon being closer to its generating point than any other generating point [48]. By creating a “full” texture image, instead of originally sparse, additional video compression efficiency may be achieved later.

Listing 1.

Voronoi diagram creation.

//create geometry and texture image using 3DTK
//get mask as the inverse of the existing pixels from~geometry

distanceTransform(mask, distance, labels, CV_DIST_L2,
CV_DIST_MASK_PRECISE, CV_DIST_LABEL_PIXEL);

//map labels to indices
std::vector<cv::Vec2i> label_to_index;

//reserve memory for faster push_back
label_to_index.reserve(sum(~mask)[0]);

for (int row = 0; row < mask.rows; ++row)
  for (int col = 0; col < mask.cols; ++col)
   if (mask.at<uchar>(row, col) == 0) //this pixel exist
    label_to_index.push_back(cv::Vec2i(row, col));

//create ``full’’ image
for (int row = 0; row < mask.rows; ++row) {
  for (int col = 0; col < mask.cols; ++col) {
   if (mask.at<uchar>(row, col) > 0) { //so this pixel needs to be filled
    colorImage.at<cv::Vec3b>(row,col)=
    cv::Vec3b(colorImage.at<cv::Vec3b>(label_to_index[labels.at<int>(row, col)])[0],
    colorImage.at<cv::Vec3b>(label_to_index[labels.at<int>(row, col)])[1],
    colorImage.at<cv::Vec3b>(label_to_index[labels.at<int>(row, col)])[2]);
  }
}
}

1056 × 944 Pixels (Approximately 1,000,000)	Azimuthal	Conic	Cylindrical	Equalareacylindrical	Equirectangular	Mercator	Miller 10-Bit	Miller 9-Bit	Pannini	Rectilinear	Stereographic
Compressed color average bits per input point (bpp):	2.2018	2.4025	3.0207	2.7203	2.8726	2.9612	2.9618	2.9618	2.826	2.63	2.3026
Compressed range average bits per input point (bpp):	3.0784	2.8992	3.4701	3.6785	4.074	3.9306	4.7362	3.9652	3.1382	2.7561	3.0313
Compressed range+color average bits per input point (bpp):	5.2803	5.3016	6.4908	6.3987	6.9467	6.8918	7.698	6.927	5.9642	5.386	5.3339
Input average bits per input point (binary format) (bpp):	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017
Ratio (input bpp)/(compressed bpp) * 100%	4.4001	4.418	5.409	5.3322	5.7888	5.7431	6.4149	5.7724	4.9701	4.4883	4.4448
Compressed video color (Mbps):	52.514	57.2993	72.0449	64.8792	68.5135	70.626	70.6387	70.6387	67.3999	62.7257	54.9167
Compressed video range (Mbps):	73.4217	69.146	82.7638	87.7325	97.1667	93.7467	112.9608	94.5713	74.8475	65.7331	72.2982
Compressed video range+color (Mbps):	125.9357	126.4452	154.8087	152.6117	165.6802	164.3727	183.5995	165.2101	142.2474	128.4588	127.2149
Input point cloud (binary format) (Mbps):	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774
Ratio (input Mbps)/(compressed Mbps) * 100%	4.4001	4.4180	5.4090	5.3322	5.7888	5.7431	6.4149	5.7724	4.9701	4.4883	4.4448

1296 × 1160 Pixels (Approximately 1,500,000)	Azimuthal	Conic	Cylindrical	Equalareacylindrical	Equirectangular	Mercator	Miller 10-Bit	Miller 9-Bit	Pannini	Rectilinear	Stereographic
Compressed color average bits per input point (bpp):	3.2534	3.4712	4.3975	3.9488	4.2032	4.4046	4.3085	4.3085	4.1135	3.8306	3.4746
Compressed range average bits per input point (bpp):	4.4431	4.1957	4.7533	5.1534	5.7746	5.5972	6.6708	5.6485	4.3288	3.8173	4.5586
Compressed range+color average bits per input point (bpp):	7.6965	7.6669	9.1508	9.1022	9.9778	10.0018	10.9793	9.957	8.4423	7.6479	8.0332
Input average bits per input point (binary format) (bpp):	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017
Ratio (input bpp)/(compressed bpp) * 100%	6.4136	6.389	7.6255	7.585	8.3147	8.3347	9.1493	8.2974	7.0352	6.3732	6.6942
Compressed video color (Mbps):	77.5949	82.7901	104.881	94.18	100.2465	105.0509	102.7599	102.7599	98.109	91.36	82.8696
Compressed video range (Mbps):	105.9685	100.0686	113.3672	122.9091	137.726	133.4953	159.1008	134.7181	103.2435	91.0447	108.7247
Compressed video range+color (Mbps):	183.5634	182.8587	218.2482	217.0891	237.9725	238.5462	261.8607	237.4779	201.3525	182.4047	191.5943
Input point cloud (binary format) (Mbps):	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774
Ratio (input Mbps)/(compressed Mbps) * 100%	6.4136	6.3890	7.6255	7.5850	8.3147	8.3347	9.1493	8.2974	7.0352	6.3732	6.6942

1496 ×1344 Pixels (Approximately 2,000,000)	Azimuthal	Conic	Cylindrical	Equalareacylindrical	Equirectangular	Mercator	Miller 10-Bit	Miller 9-Bit	Pannini	Rectilinear	Stereographic
Compressed color average bits per input point (bpp):	4.2758	4.5804	5.6975	5.1732	5.5685	5.675	5.669	5.669	5.3434	4.9298	4.6411
Compressed range average bits per input point (bpp):	5.5811	5.3118	5.8544	6.3141	7.1576	6.9908	8.2718	7.0543	5.348	4.7428	5.8841
Compressed range+color average bits per input point (bpp):	9.8569	9.8922	11.5519	11.4873	12.7261	12.6657	13.9409	12.7233	10.6913	9.6726	10.5252
Input average bits per input point (binary format) (bpp):	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017	120.0017
Ratio (input bpp)/(compressed bpp) * 100%	8.2139	8.2434	9.6265	9.5726	10.6049	10.5546	11.6172	10.6026	8.9093	8.0604	8.7709
Compressed video color (Mbps):	101.9793	109.2427	135.8868	123.3824	132.8104	135.3498	135.2079	135.2079	127.4406	117.5775	110.6906
Compressed video range (Mbps):	133.1103	126.6887	139.6304	150.594	170.7107	166.7318	197.286	168.2461	127.5504	113.1165	140.3382
Compressed video range+color (Mbps):	235.0896	235.9314	275.5172	273.9764	303.5211	302.0816	332.4939	303.454	254.991	230.6941	251.0288
Input point cloud (binary format) (Mbps):	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774	2862.0774
Ratio (input Mbps)/(compressed Mbps) * 100%	8.2139	8.2434	9.6265	9.5726	10.6049	10.5546	11.6172	10.6026	8.9093	8.0604	8.7709

	$rmsF p 2 p$	${rmsFPSNR}_{1} p 2 p$	${rmsFPSNR}_{2} p 2 p$	$rmsF p 2 pl$	${rmsFPSNR}_{1} p 2 pl$	${rmsFPSNR}_{2} p 2 pl$	Average Input Points	Average Output Points	Output/Input $\cdot 100 %$
Azimuthal	4.3569	58.5848	−5.7881	1.1374	64.4248	5.8920	795,010	260,522	32.7697
Conic	3.8489	59.1248	−4.7080	1.0048	64.9778	6.9979	795,010	250,454	31.5033
Cylindrical	1.6901	62.7506	2.5435	0.3878	69.1378	15.3180	795,010	318,703	40.0879
Equalareacylindrical	3.2290	59.8909	−3.1758	0.7819	66.0874	9.2171	795,010	283,737	35.6897
Equirectangular	1.8139	62.4501	1.9426	0.4156	68.8628	14.7679	795,010	333,865	41.9951
Mercator	1.6177	62.9459	2.9342	0.3677	69.3777	15.7978	795,010	344,682	43.3557
Miller 10-bit	1.5704	63.0796	3.2017	0.3505	69.5962	16.2347	795,010	345,223	43.4237
Miller 9-bit	1.6200	62.9388	2.9200	0.3686	69.3682	15.7787	795,010	345,223	43.4237
Pannini	1.7741	62.5377	2.1177	0.4092	68.9031	14.8485	795,010	291,890	36.7153
Rectilinear	1.9423	62.1427	1.3278	0.4655	68.3559	13.7542	795,010	260,368	32.7503
Stereographic	3.6261	59.3823	−4.1930	0.8135	65.9116	8.8655	795,010	283,551	35.6663

	$rmsF p 2 p$	${rmsFPSNR}_{1} p 2 p$	${rmsFPSNR}_{2} p 2 p$	$rmsF p 2 pl$	${rmsFPSNR}_{1} p 2 pl$	${rmsFPSNR}_{2} p 2 pl$	Average Input Points	Average Output Points	Output/Input $\cdot 100 %$
Azimuthal	3.4281	59.6300	−3.6976	0.8209	65.8651	8.7725	795,010	313,372	39.4174
Conic	3.0146	60.1911	−2.5754	0.7793	66.0917	9.2258	795,010	300,309	37.7742
Cylindrical	1.1212	64.5158	6.0740	0.2697	70.6845	18.4113	795,010	389,681	49.0159
Equalareacylindrical	2.4257	61.1371	−0.6835	0.5808	67.3735	11.7893	795,010	329,721	41.4738
Equirectangular	1.1851	64.2722	5.5868	0.2797	70.5251	18.0925	795,010	390,058	49.0633
Mercator	1.0756	64.6961	6.4347	0.2590	70.8577	18.7578	795,010	406,907	51.1826
Miller 10-bit	1.0077	64.9852	7.0127	0.2333	71.3168	19.6760	795,010	406,868	51.1777
Miller 9-bit	1.0758	64.6942	6.4308	0.2593	70.8521	18.7466	795,010	406,868	51.1777
Pannini	1.1853	64.2675	5.5774	0.2849	70.4427	17.9277	795,010	359,861	45.2650
Rectilinear	1.3303	63.7622	4.5667	0.3185	69.9618	16.9659	795,010	324,548	40.8231
Stereographic	2.5708	60.8974	−1.1628	0.5539	67.6207	12.2838	795,010	343,921	43.2600

	$rmsF p 2 p$	${rmsFPSNR}_{1} p 2 p$	${rmsFPSNR}_{2} p 2 p$	$rmsF p 2 pl$	${rmsFPSNR}_{1} p 2 pl$	${rmsFPSNR}_{2} p 2 pl$	Average Input Points	Average Output Points	Output/Input $\cdot 100 %$
Azimuthal	2.4840	61.0363	−0.8851	0.5858	67.3331	11.7085	795,010	351,469	44.2094
Conic	2.4327	61.1258	−0.7061	0.6254	67.0423	11.1270	795,010	334,183	42.0351
Cylindrical	0.8799	65.5575	8.1574	0.2266	71.4296	19.9015	795,010	441,458	55.5286
Equalareacylindrical	1.8918	62.2127	1.4677	0.4537	68.4265	13.8954	795,010	361,188	45.4319
Equirectangular	0.9269	65.3283	7.6989	0.2354	71.2633	19.5689	795,010	428,221	53.8636
Mercator	0.8436	65.7380	8.5184	0.2199	71.5590	20.1603	795,010	449,793	56.5770
Miller 10-bit	0.7681	66.1541	9.3506	0.1907	72.1825	21.4073	795,010	449,260	56.5100
Miller 9-bit	0.8462	65.7263	8.4950	0.2203	71.5502	20.1427	795,010	449,260	56.5100
Pannini	0.9405	65.2639	7.5701	0.2402	71.1749	19.3922	795,010	410,965	51.6931
Rectilinear	1.0556	64.7615	6.5653	0.2653	70.7451	18.5326	795,010	374,173	47.0652
Stereographic	1.8777	62.2770	1.5964	0.4012	69.0208	15.0840	795,010	386,019	48.5552

	$rmsF p 2 p$	${rmsFPSNR}_{1} p 2 p$	${rmsFPSNR}_{2} p 2 p$	$rmsF p 2 pl$	${rmsFPSNR}_{1} p 2 pl$	${rmsFPSNR}_{2} p 2 pl$	Average Input Points	Average Output Points	Output/Input $\cdot 100 %$
Azimuthal	3.9483	59.2665	−4.4246	3.7950	59.4562	−4.0451	795,010	1,153,609	145.1062
Conic	3.6778	60.0292	−2.8992	3.4638	60.3884	−2.1809	795,010	1,153,016	145.0316
Cylindrical	1.8932	63.1058	3.2539	1.7200	64.0822	5.2069	795,010	1,155,024	145.2842
Equalareacylindrical	3.6355	59.5268	−3.9040	3.5119	59.6839	−3.5898	795,010	1,153,801	145.1304
Equirectangular	1.7756	63.3958	3.8340	1.6544	64.0645	5.1714	795,010	1,154,894	145.2679
Mercator	1.6122	63.7848	4.6119	1.4535	64.7498	6.5421	795,010	1,154,705	145.2441
Miller 10-bit	1.7871	63.3660	3.7744	1.6477	64.1868	5.4161	795,010	1,156,485	145.4680
Miller 9-bit	1.8834	63.1429	3.3282	1.7325	64.0029	5.0481	795,010	1,155,006	145.2819
Pannini	2.1112	62.6956	2.4336	1.9620	63.4910	4.0243	795,010	1,155,073	145.2904
Rectilinear	2.0828	62.6544	2.3512	1.9141	63.4887	4.0197	795,010	1,154,906	145.2694
Stereographic	3.5426	59.6347	−3.6883	3.4265	59.7821	−3.3934	795,010	1,155,261	145.3140

	$rmsF p 2 p$	${rmsFPSNR}_{1} p 2 p$	${rmsFPSNR}_{2} p 2 p$	$rmsF p 2 pl$	${rmsFPSNR}_{1} p 2 pl$	${rmsFPSNR}_{2} p 2 pl$	Average Input Points	Average Output Points	Output/Input $\cdot 100 %$
Azimuthal	3.4226	59.8245	−3.3087	3.2923	60.0029	−2.9519	795,010	1,153,402	145.0802
Conic	3.0916	60.6366	−1.6844	2.9537	60.8878	−1.1820	795,010	1,152,742	144.9972
Cylindrical	1.3590	65.7201	8.4825	1.2248	67.0806	11.2035	795,010	1,154,542	145.2236
Equalareacylindrical	3.3320	60.1398	−2.6780	3.2203	60.3042	−2.3493	795,010	1,153,681	145.1153
Equirectangular	0.9471	66.7002	10.4428	0.8230	67.9619	12.9661	795,010	1,154,483	145.2162
Mercator	0.6504	68.1203	13.2829	0.5124	69.8651	16.7726	795,010	1,154,466	145.2140
Miller 10-bit	0.8328	67.7035	12.4494	0.7035	69.2895	15.6213	795,010	1,156,160	145.4271
Miller 9-bit	0.9335	67.2952	11.6327	0.8016	68.8654	14.7731	795,010	1,154,383	145.2036
Pannini	1.1857	65.9413	8.9250	1.0427	67.3226	11.6876	795,010	1,154,553	145.2250
Rectilinear	1.3954	64.9695	6.9813	1.2327	66.2226	9.4875	795,010	1,154,607	145.2318
Stereographic	2.4495	61.1509	−0.6558	2.3639	61.3091	−0.3394	795,010	1,155,021	145.2838

	$rmsF p 2 p$	${rmsFPSNR}_{1} p 2 p$	${rmsFPSNR}_{2} p 2 p$	$rmsF p 2 pl$	${rmsFPSNR}_{1} p 2 pl$	${rmsFPSNR}_{2} p 2 pl$	Average Input Points	Average Output Points	Output/Input $\cdot 100 %$
Azimuthal	3.5851	59.9790	−2.9997	3.4530	60.1616	−2.6345	795,010	1,153,094	145.0414
Conic	4.0596	59.8137	−3.3304	3.9061	60.0175	−2.9226	795,010	1,152,734	144.9962
Cylindrical	0.3314	69.8545	16.7514	0.2215	71.6639	20.3702	795,010	1,154,352	145.1997
Equalareacylindrical	3.1122	60.9044	−1.1489	3.0077	61.0880	−0.7816	795,010	1,153,599	145.1050
Equirectangular	0.9305	67.6623	12.3669	0.8296	69.0639	15.1701	795,010	1,154,520	145.2208
Mercator	0.7053	68.7747	14.5918	0.6102	70.3353	17.7129	795,010	1,154,439	145.2106
Miller 10-bit	0.4555	69.8594	16.7611	0.3619	71.5899	20.2221	795,010	1,156,008	145.4080
Miller 9-bit	0.4522	69.8166	16.6757	0.3492	71.6585	20.3594	795,010	1,154,324	145.1962
Pannini	0.5384	68.7913	14.6249	0.4139	70.5697	18.1817	795,010	1,154,358	145.2004
Rectilinear	1.2472	66.2188	9.4800	1.1264	67.5502	12.1428	795,010	1,154,410	145.2070
Stereographic	2.8045	61.1347	−0.6883	2.6960	61.3675	−0.2226	795,010	1,155,019	145.2836

Timing Performance	Seconds per Point Cloud	Seconds per 20 Point Clouds
Point cloud to panorama:	1.9030	38.0603
Compression (texture):	-	117.3344
Decompression (texture):	-	2.0918
Compression (geometry):	-	0.9477
Decompression (geometry):	-	1.2366
Panorama to point cloud:	4.9350	98.6991
Normal calculation (CloudCompare):	6.2539	125.0783
Poisson reconstruction and upsampling (MeshLab):	97.0496	1941.0

Longdress, R = 1	O = 1	O = 2	O = 3	O = 4	O = 5	O = 6	O = 7	O = 8
Average number of output points:	230,014	427,083	575,585	698,867	761,164	793,732	794,969	795,009
Average size of .oct file, bytes:	3,684,335	6,837,436	9,213,473	11,185,991	12,182,742	12,703,824	12,723,616	12,724,261
Average number of input points:	795,010	795,010	795,010	795,010	795,010	795,010	795,010	795,010
Average bits per input point:	37.0746	68.8035	92.7130	112.5620	122.5921	127.8356	128.0348	128.0413
$rmsF p 2 p$ :	0.7755	0.4700	0.2766	0.1209	0.0426	0.0016	0.0001	0.0000
${rmsFPSNR}_{1} p 2 p$ :	66.0727	68.2475	70.5507	74.1436	78.6783	92.9780	108.5374	Inf
${rmsFPSNR}_{2} p 2 p$ :	9.1877	13.5374	18.1437	25.3295	34.3989	62.9984	94.1172	Inf
$rmsF p 2 pl$ :	0.2547	0.1700	0.1012	0.0499	0.0178	0.0007	0.0000	0.0000
${rmsFPSNR}_{1} p 2 pl$ :	70.9081	72.6655	74.9173	77.9851	82.4677	96.4049	112.7170	Inf
${rmsFPSNR}_{2} p 2 pl$ :	18.8587	22.3734	26.8770	33.0126	41.9778	69.8522	102.4764	Inf

Longdress, R = 2	O = 1	O = 2	O = 3	O = 4	O = 5	O = 6	O = 7	O = 8
Average number of output points:	60,161	116,599	170,613	222,186	270,673	317,488	362,572	406,039
Average size of .oct file, bytes:	966,646	1,869,673	2,733,914	3,559,090	4,334,882	5,083,923	5,805,268	6,500,740
Average number of input points:	795,010	795,010	795,010	795,010	795,010	795,010	795,010	795,010
Average bits per input point:	9.7271	18.8141	27.5107	35.8143	43.6209	51.1583	58.4171	65.4154
$rmsF p 2 p$ :	1.5428	1.1679	0.9701	0.8358	0.7339	0.6492	0.5757	0.5099
${rmsFPSNR}_{1} p 2 p$ :	63.0855	64.2946	65.1006	65.7477	66.3126	66.8453	67.3669	67.8937
${rmsFPSNR}_{2} p 2 p$ :	3.2134	5.6316	7.2437	8.5378	9.6676	10.7330	11.7762	12.8298
$rmsF p 2 pl$ :	0.3484	0.3121	0.2846	0.2599	0.2369	0.2152	0.1945	0.1748
${rmsFPSNR}_{1} p 2 pl$ :	69.5486	70.0257	70.4259	70.8213	71.2238	71.6410	72.0803	72.5437
${rmsFPSNR}_{2} p 2 pl$ :	16.1395	17.0939	17.8941	18.6850	19.4899	20.3244	21.2030	22.1299

Redandblack, R = 1	O = 1	O = 2	O = 3	O = 4	O = 5	O = 6	O = 7	O = 8
Average number of output points:	211,582	389,317	517,726	620,888	672,157	699,481	701,144	701,234
Average size of .oct file, bytes:	3,389,106	6,232,878	8,287,429	9,938,021	10,758,325	11,195,502	11,222,110	11,223,546
Average number of input points:	795,010	795,010	795,010	795,010	795,010	795,010	795,010	795,010
Average bits per input point:	34.1038	62.7200	83.3945	100.0040	108.2585	112.6577	112.9255	112.9399
Symmetric rmsF p2p:	0.7591	0.4515	0.2622	0.1146	0.0415	0.0025	0.0001	0.0000
Symmetric PSNR_1 p2p:	66.1656	68.4226	70.7825	74.3788	78.7935	90.9982	103.9500	Inf
Symmetric PSNR_2 p2p:	10.4646	14.9786	19.6983	26.8910	35.7204	60.1298	86.0333	Inf
Symmetric rmsF p2pl:	0.2497	0.1630	0.0955	0.0472	0.0173	0.0011	0.0000	0.0000
Symmetric PSNR_1 p2pl:	70.9956	72.8476	75.1677	78.2340	82.5916	94.4710	108.0630	Inf
Symmetric PSNR_2 p2pl:	20.1246	23.8286	28.4687	34.6014	43.3166	67.0754	94.2595	Inf

Redandblack, R = 2	O = 1	O = 2	O = 3	O = 4	O = 5	O = 6	O = 7	O = 8
Average number of output points:	55,189	106,830	156,077	202,877	246,949	289,386	330,106	369,273
Average size of .oct file, bytes:	886,801	1,713,077	2,501,036	3,249,837	3,954,998	4,633,979	5,285,505	5,912,168
Average number of input points:	795,010	795,010	795,010	795,010	795,010	795,010	795,010	795,010
Average bits per input point:	8.9237	17.2383	25.1673	32.7023	39.7982	46.6306	53.1868	59.4928
Symmetric rmsF p2p:	1.5165	1.1475	0.9520	0.8189	0.7171	0.6323	0.5586	0.4924
Symmetric PSNR_1 p2p:	63.1605	64.3712	65.1824	65.8364	66.4132	66.9598	67.4984	68.0461
Symmetric PSNR_2 p2p:	4.4544	6.8758	8.4982	9.8061	10.9599	12.0530	13.1301	14.2255
Symmetric rmsF p2pl:	0.3515	0.3120	0.2826	0.2568	0.2330	0.2106	0.1894	0.1694
Symmetric PSNR_1 p2pl:	69.5091	70.0272	70.4576	70.8731	71.2956	71.7355	72.1946	72.6804
Symmetric PSNR_2 p2pl:	17.1515	18.1877	19.0486	19.8797	20.7245	21.6044	22.5225	23.4941

Soldier, R = 1	O = 1	O = 2	O = 3	O = 4	O = 5	O = 6	O = 7	O = 8
Average number of output points:	298,525	554,593	753,226	920,279	1,011,060	1,058,928	1,060,749	1,060,770
Average size of .oct file, bytes:	4,781,816	8,878,917	12,057,034	14,729,884	16,182,391	16,948,270	16,977,416	16,977,739
Average number of input points:	795,010	795,010	795,010	795,010	795,010	795,010	795,010	795,010
Average bits per input point:	48.1183	89.3465	121.3271	148.2234	162.8396	170.5465	170.8398	170.8430
Symmetric rmsF p2p:	0.7867	0.4851	0.2906	0.1324	0.0469	0.0017	0.0000	0.0000
Symmetric PSNR_1 p2p:	66.0108	68.1101	70.3362	73.7484	78.2606	92.5742	112.3807	Inf
Symmetric PSNR_2 p2p:	6.5947	10.7933	15.2455	22.0699	31.0944	59.7214	99.3344	Inf
Symmetric rmsF p2pl:	0.2672	0.1826	0.1110	0.0557	0.0199	0.0008	0.0000	0.0000
Symmetric PSNR_1 p2pl:	70.7010	72.3536	74.5152	77.5116	81.9802	95.8459	116.2297	Inf
Symmetric PSNR_2 p2pl:	15.9752	19.2803	23.6035	29.5964	38.5334	66.2649	107.0324	Inf

Soldier, R = 2	O = 1	O = 2	O = 3	O = 4	O = 5	O = 6	O = 7	O = 8
Average number of output points:	78,169	151,269	221,122	287,916	350,702	411,306	469,731	526,048
Average size of .oct file, bytes:	1,256,069	2,425,706	3,543,359	4,612,076	5,616,648	6,586,314	7,521,122	8,422,194
Average number of input points:	795,010	795,010	795,010	795,010	795,010	795,010	795,010	795,010
Average bits per input point:	12.6395	24.4093	35.6560	46.4102	56.5190	66.2765	75.6833	84.7506
Symmetric rmsF p2p:	1.5565	1.1815	0.9842	0.8503	0.7489	0.6649	0.5922	0.5272
Symmetric PSNR_1 p2p:	63.0473	64.2443	65.0378	65.6730	66.2245	66.7409	67.2443	67.7490
Symmetric PSNR_2 p2p:	0.6678	3.0617	4.6487	5.9190	7.0221	8.0549	9.0618	10.0711
Symmetric rmsF p2pl:	0.3544	0.3203	0.2942	0.2705	0.2483	0.2271	0.2067	0.1871
Symmetric PSNR_1 p2pl:	69.4744	69.9128	70.2828	70.6468	71.0190	71.4072	71.8164	72.2488
Symmetric PSNR_2 p2pl:	13.5218	14.3986	15.1387	15.8666	16.6112	17.3875	18.2058	19.0707

PERMALINK

Dynamic Point Cloud Compression Based on Projections, Surface Reconstruction and Video Compression

Emil Dumic

Anamaria Bjelopera

Andreas Nüchter

Roles

Abstract

1. Introduction

2. Related Work

3. Projection Types and Their Description

3.1. Lambert Azimuthal Equal-Area Projection

Figure 1.

3.2. Albers Equal-Area Conic Projection

Figure 2.

3.3. Cylindrical Projection

Figure 3.

3.4. Cylindrical Equal-Area Projection

Figure 4.

3.5. Equidistant Cylindrical Projection

Figure 5.

3.6. Mercator Projection

Figure 6.

3.7. Miller Projection

Figure 7.

3.8. Rectilinear Projection

Figure 8.

3.9. Pannini Projection

Figure 9.

3.10. Stereographic Projection

Figure 10.

4. Creation of Panorama Images from Point Clouds and Recreation of Point Clouds

Listing 1.

Figure 11.

Figure 12.

5. Results

5.1. Objective Measures Used for Point Cloud Performance Comparison

5.2. Point Cloud Compression Using Different Projections—Compression Efficiency

Table 1.

Table 2.

Table 3.

Figure 13.

5.3. Point Cloud Compression Using Different Projections—Objective Measures

Table 4.

Table 5.

Table 6.

Table 7.

Table 8.

Table 9.

Figure 14.

Figure 15.

Figure 16.

5.4. Point Cloud Compression—Timing Performance

Table 10.

5.5. Point Cloud Compression Using Different Point Clouds

Figure 17.

Figure 18.

Figure 19.

Figure 20.

5.6. Comparison with Octree Reduction from 3DTK Toolkit

Table 11.

Table 12.

Table 13.

Table 14.

Table 15.

Table 16.

5.7. Discussion

6. Conclusions

Acknowledgments

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles