Personal View site logo
Intel GPU design
  • Intel processor graphics GPUs have two main groups of hardware:

    image

    *Execution Units/EUs: General purpose execution units. These are used for graphics rendering, but they are also suited to a wide range of media processing tasks. * Media Fixed Function: In addition, specialized fixed function hardware accelerates video codec and frame processing algorithms for fundamentally higher performance at lower power than the EUs or CPUs.

    The basic unit of the GPU architecture is the subslice. This contains EUs and memory components. They are assembled into slices.

    image

    Subslices are assembled into slices

    image

    Execution Units are general purpose compute engines ideal for video processing uses. They are often used by encode for parts of the algorithm not run in fixed function like rate control and mode decisions. The samplers are also highly important to media tasks. They are used by resize and motion estimation.

    In addition to EU slices, there is an “unslice” with additional hardware engines individually schedulable for media tasks:

    image

    image

    1. VDBox (also known as MFX) for codec operations
    2. VEBox (also known as VQE) provides hardware acceleration for video enhancement/frame processing operations.

    VDbox/MFX contains:

    • Bitstream decoder (BSD).
    • ENC (intra prediction, motion estimation)
    • PAK (quantization, entropy coding, pixel reconstruction, motion compensation)

    VEbox/VQE contains:

    • Denoise
    • Advanced Deinterlace (ADI)
    • Local Adaptive Contrast Enhancement (LACE)
    • Camera processing features (skin tone enhancement, etc.)

    Intel FP32 GPU performance is still very low

    image

    Info from https://software.intel.com/sites/default/files/managed/09/02/Intel_Media_Developers_Guide.pdf

  • 2 Replies sorted by
  • 7th gen CPUs addition

    image

    image

    image

    sample305.jpg
    800 x 395 - 94K
    sample306.jpg
    800 x 377 - 78K
    sample304.jpg
    637 x 432 - 55K
  • Good page about QuickSync, old but good

    image

    http://www.tomshardware.com/reviews/ivy-bridge-benchmark-core-i7-3770k,3181-7.html

    Note that 3770K already can decode multiple 4K H.264 streams.

    sample546.jpg
    726 x 788 - 108K