Answers to Selected Problems and Questions

Chapter 1 Introduction to Image Understanding

1-5{Points}: An image is obtained by perceiving the objective world with an observing system, which produces visual perception of the entity; a graphic is an image generated from a non-image form of data description and is a visual representation of known data.

There are many uncertainties in image understanding and computer vision. Computer graphics is used mostly to treat a deterministic problem, which can be solved by mathematical methods.

1-9{Hint} In general, most high-level tasks have this feature, one of the reasons is that it is determined by the characteristics of the current computer system, Von Neumann machine.

Chapter 2 Stereo Vision

2-1According to the formula D = A · B/(A − Z), for point (1m, 0m, 2m), there is D = 0.0010256m; for point (2m, 0m, 3m), there is D = 0.00067796m. When D = 0.002 m, there is Z = 0.95 m.

2-9Here, the measurement accuracy of two binocular systems with the same baseline length is compared. According to eq. (2.16), the parallax is inversely proportional to the distance, and the distance between the point P and the camera R is closer than the distance between the point P and the camera T, so that the stereoscopic image composed of the left image L and the right image R can produce more accurate results when measuring the depth value of the point P.

Chapter 3 Image Enhancement

3-5The normal vector [p, q, s]T of a point (x, y) on the surface of the sphere is

p=zx=xr2x2y2=xzq=zy=yr2x2+y2=yz

The light source vector is [a, b, s]T. For the complete scattering surface, eq. (3.26) can be used, that is

E(x,y)=R(p,q)=1+psp+qsq1+p2+q21+ps2+qs2=1+ax/z+by/z1+(x/z)2+(y/z)21+a2+b2=1+ax/z+by/zr21+a2+b2=1+ax/z+by/zr1+a2+b2

3-9The orientation of the object plane is determined by two angles, where the ratio of the major axis length over the minor axis length gives the inclination angle of the plane normal to the camera axis, and it is arccos(3/4) = 41.41° here; and the angle between the major axis and the X-axis 135° gives the rotation angle of the plane normal to the camera axis. In terms of symmetry, there are two possible orientations of the object plane, namely (inclination angle, rotation angle) = (41.41°, 135°) and (inclination angle, rotation angle) = (−41.41°, 135°).

Chapter 4 Image Restoration

4-2Sufficiency can be directly obtained from eq. (4.11). Necessity can be divided into two cases to discuss.

(1)The length of A is greater than the length of B: If the length of A is N, then there must be N > M (the length of B is greater than or equal to M at this time), that is Q > 0. Similarly, the case that the length of B is greater than the length of A can be proved.

(2)The length of A is equal to the length of B:If the length of A is N, and N = M (i.e., A and B are not exactly the same string), then N > M, and Q > 0.
As discussed above, Q can only be zero if A and B are exactly the same string.

4-7Use the method of counter-evidence. Suppose the two graphs are isomorphic, then the parallel edges a and c should correspond to the parallel edges x and z (the only parallel edges), so the vertices A and B should correspond to vertices X and Y, but both A and B are the endpoints of the three edges, and X and Y are not all the endpoints of three edges, contradiction.

Chapter 5 Scene Analysis and Semantic Interpretation

5-2g = 170.

5-11{Hint} Refer to Figure 5.1.

Chapter 6 Multi-Sensor Image Fusion

6-8(1)  Since XYX,XYY,so R*(X),R*(XY)R*(Y)

(2)Since XXY,XY,soR*(XY),R*(Y)R*(XY)

6-12Assume X is R-rough and R-definable, then R*(X) ≠ , R*(X) ≠ L.
R*(X)xX,so[x]RX[x]RX¯=R*(X¯)L
R*(X)LyL,so[y]RR*(X)=0[y]RR*(X¯)R*(X¯)0.

Chapter 7 Content-Based Image Retrieval

7-1The match value between Figure Problem 7-1(a, b) is 7/16. The match value between Figure Problem 7-1(b, c) is 9/16.

7-6For example, the ratio of height and width of Feret box (see Section 3.3.2 in Volume II) as well as form factor (see Section 6.3.1 in Volume II). The appearance of the trousers in the three images is mainly different in the bifurcation of trousers legs. Thus, their ratios of height and width of Feret box would be significantly different; while their shape factors would be quite close (the contour length and the silhouette area are similar).

Chapter 8 Color Image Processing

8-3{Hint} One possible approach is to resample (or interpolate) the raw data, so as to transform the anisotropic data into isotropic data.

8-12The first line: Define a testing procedure for a car in the parking lot parade, in which the car is represented by v and the parking lot is represented by lot;

The second line: The detection begins when the car drives into the parking lot;

The third line: The counter i is set to 0;

The fourth line: A cycle of statistical count starts.

The fifth line: If the car is in the parking lot and the car turns in the parking lot on the road for one circle, the counter will increase by 1;

The sixth line: If the counter reaches a predetermined threshold n, the counting is stopped;

The seventh line: The car leaves the parking lot, the detection is ended and the program exits.

Consider a basketball player practicing shooting activities, as shown in Figure Solution 8-12. The player is represented by p, basketball court is represented by court. The detection starts from the moment the player enters the basketball court and makes a basketball shot. Making a basketball shot (making a ball into the basket), the counter plus 1 to i. If a predetermined threshold n is reached, the counting is stopped, the training is ended and the player can leave the court.

Figure Solution 8-12

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.14.144.229