In this tutorial, you will read the examination global file and generate four files—one for each particular skill. The files will contain the top 10 scores for each skill. The scores will not be the original, but converted to a scale with values in the range 0-5.
As you must be already quite confident with PDI, some explanations in this section will not have the full details. On the contrary, the general explanation will be focused on the structure of the jobs and transformations.
transformations
folder under the name top_scores.ktr
. file_processed
and process_date
.name
and last name
. name
and last name
to uppercase. writing, reading, speaking
, and listening
with the same value divided by 20
—for example, [writing]/20
. You have already done this in Chapter 3. writing
field.trans_Status = CONTINUE_TRANSFORMATION; if (getProcessCount('r')>10) trans_Status = SKIP_TRANSFORMATION;
seq_w
. Leave the defaults so that the field contains the values 1, 2, 3
... seq_w
as position
and the field writing
as score
. Specify this change in the Select & Alter tab, and check the option Include unspecified fields, ordered. writing_top10.txt
at the location specified by the ${LABSOUTPUT}
variable. In the Fields tab, put the following fields— position, student_code, student_name, student_lastname
, and score
. reading
field, rename the sequence seq_r
as position
and the field reading
as score
, and send the data to the reading_top10.txt
file. speaking
field and the listening
field. writing_top10.txt
file (the names and values may vary depending on the examination files that you have appended to the global file):You read the big file with examination results and generated four files with information about the top scores—one file for each skill.
Beyond having used the Add sequences step for the first time, there was nothing new. However, there are several improvements you can do to this transformation. The next tutorials are meant to teach you some tricks.
In the previous tutorial, you used different names for the sequences and then you renamed all of them to position
. Which of the following options gives you the same results you got in the tutorial?
a. Using position
as the name of the sequence in all Add sequence steps
b. Joining the four streams with a single Add sequence step and then splitting the stream back into four streams by using the Distribute method you learned in Chapter 4
c. Joining the four streams with a single Add sequence step and then splitting the stream back into four streams by using a Switch case step that distributes the rows properly
d. All of them
e. None of them
3.21.246.223