References

140 9. CODE OPTIMIZATION

L9 LAB 9:

CODE OPTIMIZATION

e purpose of this lab is to experiment with the optimization steps discussed above. ese steps

include changing compiler settings, writing eﬃcient code constructs, and using architecture-

speciﬁc instructions for the ARM processor. e FIR ﬁltering (linear convolution) example is

considered as a model case to show the eﬀects of these steps on the real-time throughput.

Consider a lowpass ﬁlter whose passband covers the human vocal frequency range. e

speciﬁcation used to generate the ﬁlter in MATLAB is as follows.

rpass = 0.1; %passband ripple

rstop = 20; %stopbad ripple

fs = 48000; %sampling frequency

f = [3000 3570]; %frequency bands

a = [1 0]; %desired amplitudes

dev = [(10^(rpass/20)-1)/(10^(rpass/20)+1) 10^(-rstop/20)]; %deviations

[n,fo,ao,w] = firpmord(f,a,dev,fs); %estimate

B = firpm(n,fo,ao,w); %compute coefficients

e above MATLAB code produces the coeﬃcient array B containing 128 coeﬃcients. e

shell for this lab provides the timing and linear convolution functions.

L9.1 COMPILER OPTIONS

Using the ﬁlter speciﬁed above and a sampling rate of 48 kHz, run the ﬁlter by enabling diﬀerent

optimization levels and report the processing times achieved. Use the recording function and

wait until the reported frame time stabilizes, or use a suﬃciently long test signal ( 20 s) and

record the processing time for the signal.

L9.2 TARGET ARCHITECTURE (ANDROID ONLY)

Using a target smartphone which supports armeabi-v7a, enable the abiFilter for armeabi-v7a

in the ndk section of the build.gradle ﬁle and enable the hardware ﬂoating-point by setting the

cﬂags to -mfloat-abi=softfp -mfpu=neon-vfpv4 -O3 and re-run the experiment. Com-

pare the processing time obtained with armeabi vs. armeabi-v7a.

L9.3 CODE MODIFICATION

Implement the linear convolution algorithm using the discussed pointer manipulation technique

and report the processing time. Compare the processing time when using ﬂoating-point values

for the ﬁlter coeﬃcients with the processing time when using double precision format for the

ﬁlter coeﬃcients. is can be done by changing the data type of the coeﬃcient storage array.

For these experiments, use the O3 compiler optimization setting.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for References