Index

images  A

Application development tools, Xeon Phi

    APIs

    asynchronous data transfer over PCI express

    CAPS compiler

    categories

    debugging applications

    IDB

    Intel C/C++ Composer XE

    Intel Cluster tools

        Bright Cluster Manager

        PBS Professional

    Intel Fortran composer XE

        APIs

        directives

        macros

    Intel Vtune Amplifier XE

    intrinsics

        C++ class libraries

    keywords

        _Cilk_offload and _Cilk_shared

        rules

        using shared virtual memory

    libraries

        automatic offload version

        compiler-assisted offload

        native/symmetric execution

    macros

        _INTEL_OFFLOAD macro

        _KNC_ macro

        _MIC_ macro

    OpenMP 4.0 extensions

    pragmas

        offload_attribute

        offload_transfer and offload_wait

        pragma offload

    third-party compilers

    third-party debuggers

        DDT

        GNU debugger

        TotalView

    third-party math libraries

        ArrayFire

        Magma MIC

images  B

images  C

images  D

Development tools, Xeon Phi

    language extensions

        compiling and running

        host and coprocessor

        IDE

        link time

        nonshared memory program

        virtual shared memory

    offload environment variables

images  E, F

images  G

images  H

images  I, J

images  K

images  L

images  M

images  N

images  O

Optimizing code

    compiler-driven optimizations

        code restructuring techniques

        prefetching

    data alignment

    Intel Cilk Plus array notation

        Intel Compiler

        OpenMP/Cilk Plus/TBB

    large pages

        cache blocking

        loop fusion/fission

        loop interchange

        loop peeling

        loop unrolling

        page size

        THP

        unroll and jam

    removing pointer aliasing

    steaming store

    vectorization

        compiler report

        vectorizing code

images  P, Q

images  R

Ring 0 driver layer components. See also MIC Platform Software Stack (MPSS)

    application and system functionalities

    coprocessor OS

    Linux virtual file system (Sysfs and Procfs)

        current configuration

        /proc virtual file system

        SCIF layer

        Xeon Phi Coprocessor Sysfs virtual file system /sys

    mic0

    MPSS stack during runtime

    network stacks

    NFS

    OFED and MPI

    system boot process

    system software application components

        miccheck

        micctrl

        micflash

        micinfo

        micnativeloadex

        micrasd

        micsmc

    third-party coprocessor OS

        ‘dmesg’ output

        GDT

        POST code

images  S

System software components See also Ring 0 driver layer components

    application components

        miccheck

        micctrl

        micflash

        micinfo

        micnativeloadex

        micrasd

        micsmc

    application layer

    Xeon Phi software layers

images  T

images  U

images  V

Vector ISA

    arithmetic and logic operation

        carry propagate instructions

        fused multiply-add

    categories

        conversions, data

        data broadcasts

        mask operations

        register memory swizzle

        shuffles

        swizzle commandoperation

    code, swizzle and shuffle

    data access operations

        memory alignment

        non-temporal data

        pack/unpack

        prefetch instructions

        scatter/gather

        streaming stores

    data types

    operand performance

    shift operation

        arithmetic

        logical shift

    type conversion, memory load

    vector instruction syntax

    vector nomenclature

images  W

Windows OS application. See also Development tools, Xeon Phi

    CAD

    debugging offload execution

        PuTTY

        steps

    MPSS installation

        micctrl

        MicFlash

        MicInfo tool

        MicRas

        MicSmc

        package

        SDK (Binutils)

        system components

    native programs build applications

    VTune Amplifier XE tool

images  X, Y, Z

Xeon Phi

    code generation

        execution mode

        MIC architecture

        offload syntax

    development tools

        Intel Composer XE

        software

    execution models

    installation

        development tools

        MPSS stack

    language extensions

        declare functions and variables

        execution constructs

        heterogeneous computing model

        modification

        Offload Pragmas

        OFFLOAD_REPORT

        reduce function

        restrictions

        runtime library

        target update

        terminology

    software performance

    tools (see Application development tools, Xeon Phi)

Xeon Phi coprocessors

    COI and SCIF

    components

        APIC logic

        PCB

        SIU and TCU

        SMBus

        SMC

        SPI

    DMA engine

    DMA transfer

    PCIe slots

    system configuration

Xeon Phi core architecture

    cache

    computing

    lmbench benchmark

    L2 cache

    pipeline stages

        ALUs

        D0 stage

        D1 and D2 stage

        global stall pipeline architecture

        instruction fetch

        picker function

        pre-thread picker function

        TLB

        4-way multithreaded

        WB stage

    time-multiplexed multithreading

        AGI

        Gflops operations

        pairing rules

        prefix decode

    TLB

        linear address

        page table data structures

        paging mechanism

        transparent huge page

    turbo modes

    vector units

    2-wide processor

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.216.239.46