4.1.6 Example of Faceted Annotation

34 4. FACETED FRAMEWORK OF INTERACTIVE IR USER STUDIES

4.1.5 USER BEHAVIOR, EXPERIENCE, DATA ANALYSIS, AND RESULTS

To capture and evaluate the products of user studies, researchers have developed a variety of user

behavioral, performance, and experience measures, including both online measures collected within

search sessions and oine measures consisting of user annotations, document relevance and use-

fulness judgments, and other self-reported metrics (e.g., NASA Task Load Index, user engagement

indicators). To extract useful insights and actionable implications from the empirical evidence

emerged from user studies, researchers also have to feed raw data into mathematical models and

wait for the results on the output side. As is illustrated in Figures 4.1 and 4.2, the adoption of

behavioral measures and analytical models is often determined based upon the research problems

identied as well as the variables and associated main facets dened in user study design.

e metric of user behaviors and interaction experiences (e.g., satisfaction, cognitive load,

frustration) is a fundamental aspect of the user studies that aim at understanding users and applying

user-centered approach in search systems design and evaluation. With respect to user behaviors,

past user studies have explored various possible approaches to measuring multiple components of

information seeking and search episodes, such as querying (e.g., query formulation and re-formula-

tion, use of query suggestion, query diversity and complexity), browsing (e.g., dwell time on SERP

and landing/content pages), and result examination behavior (e.g., clicks, scrolls, mouse hovering).

In addition, researchers also employed oine user judgments (e.g., usefulness and relevance judg-

ment), annotations, and performance measures (e.g., average precision, normalized discounted

cumulative gain in search result examination) and evaluated the usefulness of these metrics in IR

system evaluation against a series of user-centered ground truth measures (e.g., satisfaction). Chen

et al. (2017) provides a summary of existing online and oine behavioral metrics and compared the

usefulness and eectiveness of these distinct measures in a user-centered system evaluation. Aside

from the explicit search behaviors, performances, and user annotations, some researchers have also

explored the possibility of measuring users’ cognitive and aective variations and examined the ex-

tent to which neuro-physiological measures of dierent types (e.g., eye movement, pupil size, heart

rate, skin conductance rate, EEG and fMRI signals) are associated with traditional behavioral and

experience measures (e.g., Cole et al., 2013; Edwards and Kelly, 2017; Eickho, Dungs, and Tran,

2015; Eugster et al., 2014; Gwizdka and Zhang, 2015; Moshfeghi, Triantallou, and Pollick, 2016).

In IIR user studies, the behavioral and interaction experience measures discussed above have served

as useful sensors for detecting the features and behavioral patterns of users’ search interactions.

Combining these “behavioral pixels” in modeling, we can obtain a relatively comprehensive picture

about how users’ interaction with search systems in tasks of dierent types and how they assess the

eectiveness of the system as well as their own search experience in dierent contexts.

In addition to data collection and measures, data analyses and result presentation methods

are also determined by the research questions proposed at the beginning of many user studies.

Given the nature of the research problems and goals (e.g., examining the correlation between

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 4.1.6 Example of Faceted Annotation