As I understand, img1 and img2 are matrices containing all the pixels of two images.
Suppose:
img1 = x x x
x x x
x x x
img2 = o o
o o
o o
o o
Output is a matrix whose height is the highest between img1 and img2 and whose width is the sum of both widths. I don’t know if depth is relevant, but it uses the depth (z-axis) of the first image. Then output would be:
output = 0 0 0 0 0
0 0 0 0 0
0 0 0 0 0
0 0 0 0 0
The first step is to save img1 on output. This is done occupying the indices from 0 to img1.height on the y axis and from 0 to img1.width on the y axis.
output[:img1.shape[0], :img1.shape[1],:] = img1
output = x x x 0 0
x x x 0 0
x x x 0 0
0 0 0 0 0
Next, img2 is saved from 0 to img2.height on the y axis and from img1.width to img1.width+img2.width on the x axis.
Then:
output[:img2.shape[0]:,img1.shape[1]:img1.shape[1]+img2.shape[1],:] = img2
output = x x x o o
x x x o o
x x x o o
0 0 0 o o
I suppose that this would be done in the z axis as well if both images have information on this axis.
2
solved How can we interpret this Python code? [closed]