Extrinsic Transform:
For p=[x,y,z] we have on the camera coordinates:
Camera intrinsic parameters:
If pc=Rp+t=:[xc,yc,zc]T then the point in the image plane is:
Suppose ptf and pbf are the top and bottom points for the first row object.
The distance on the y coordinate in the image plane is given by:
Case 1: first row
Suppose pf and pb are the front and back points for the last row object.
The distance on the y coordinate in the image plane is given by:
Case 2: last row (no inclination)