MPEG-4 AVC/H.264 Video Codecs Comparison 2010 - Appendixes

MSU Graphics & Media Lab (Video Group)

Project head: Dr. Dmitriy Vatolin

Measurements, analysis: Dr. Dmitriy Kulikov,
Alexander Parshin


Return to H.264 Video Codec Comparison Report


Appendix 1. Subjective Comparison and Psycho-visual Enhancements


Brief Description

In this work, we analyzed video codecs not only with help of objective metrics, but also using subjective comparisons using SAMVIQ (Subjective Assessment Method for Video Quality evaluation) methodology. This new method was created by the EBU (European Broadcasting Union).

Another task in this part of comparison was to analyze the psycho-visual enhancement quality during encoding. We tested five video codecs, and one codec was tested with two almost identical presets: one without psycho-visual enhancement and another with psycho-visual enhancement.

  • Methodology: SAMVIQ
  • Number of experts: 42
  • Number of sequences: 5
  • Number of codecs (presets): 6
  • SAMVIQ Description

    During testing, each expert is able to play any sequence from the test set and give it a mark, and he is able to play a reference video. Marks are in the range of 0 to 100. More information about the methodology can be found in the publication SAMVIQ: A new EBU methodology for video quality evaluations in multimedia(Kozamernik, F., Steinmann, V., Sunna, P. and Wyckens, E., SMPTE journal, 2005 04 April).

    Subjective Assessment Description

    Forty-two experts participated in this subjective assessment. The experts represent a variety of individuals: males and females of ages 18 to 40 who are PC users or video and graphics experts. Each user watched five video groups, with eight videos in each group. Each video group contains six encoded videos and one reference video (the viewers were unaware of which type they were viewing), as well as a reference video specified as such. Users gave each video a mark of 0 to 100, and the marks for all users and sequences were averaged. Some extreme results were discarded before obtaining the final result.

    Video Sequences

    Five different video sequences were used in this comparison.

    "Battle"

  • Sequence title: Battle
  • Resolution: 1280x544
  • Number of frames: 586
  • Color space: RGB24
  • Frames per second: 25
  • This sequence is a fragment from the beginning of the movie Terminator 2. The compression of this sequence is the most difficult among all of the sequences in the analysis. This difficulty is mainly due to three reasons: continual brightness variation (resulting from explosions and laser flashes as seen in the picture above), very fast motion and frequent scene changes. These characteristics often cause codecs to compress frames as I-frames.

    "Foorball"

  • Sequence title: Football
  • Resolution: 1280x720
  • Number of frames: 564
  • Color space: RGB24
  • Frames per second: 25
  • This sequence is a part of sports telecast. The sequence has a high contrast level, strong motion and rich colors.

    "Italy"

  • Sequence title: Italy
  • Resolution: 1280x720
  • Number of frames: 493
  • Color space: RGB24
  • Frames per second: 25
  • This sequence has many scenes with fading transitions between them. This fading can cause encoding difficulties. The sequence contains many sharp details.

    "Quadbike"

  • Sequence title: Italy
  • Resolution: 1280x720
  • Number of frames: 562
  • Color space: RGB24
  • Frames per second: 25
  • This sequence has slow motion with an almost static camera and few scene changes.

    "Simpsons"

  • Sequence title: Italy
  • Resolution: 1280x528
  • Number of frames: 514
  • Color space: RGB24
  • Frames per second: 25
  • This sequence is a part of the Simpsons animated movie; it has high contrast and different types of motion.

    Video Codecs

    Five codecs were used in the comparison:
  • DivX MPEG-4 ASP
  • DivX H.264
  • Elecard
  • MainConcept
  • x264 (two presets: with psycho-visual enhancement and without)
  • Results

    Full results could be found in professional version of this report:
    H.264 Comparison Report Pro 2010 - Enterprise edition
    This version contains only one-sequence results

    "Football" Sequence

    For the "Football" sequence, x264 provides the highest quality. x264 with psycho-visual enhancement provides very similar (almost undistinguishable) results. The codecs can be rated by visual quality as follows:
    1. x264 with psycho-visual enhancement
    2. x264
    3. MainConcept
    4. DivX H.264
    5. Elecard
    6. DivX ASP
    Mean opinion score (MOS), 'Football' sequence

    All subjective comparison results could be found in
    H.264 Comparison Report Pro 2010 - Enterprise edition

    Appendix 2. x264 Over-years Comparison


    The quality of an H.264 codec, over several years, can be compared for a given video sequence. The x264 encoder was chosen for this task because it is present in almost every MSU VIDEO MPEG-4 AVC/H.264 codec comparison, and it produces good results compared with other encoders. Next figures show the position of the x264 codec compared with other codecs for the “Battle” sequence. For all years except 2005, x264 shows the best results. For years 2006–2009, we have shown results using Y-SSIM as the quality metric; for 2005, we did not use this as the main metric. In light of these results, x264 could be a good reference encoder for analyzing the overall progress of H.264 encoders over time.
    Next figure shows the RD curve for the “Battle” sequence using x264 encoders from different years. The best encoder is this year’s x264; the worst is the 2005 version.

    Using SSIM, the codecs can be ranked as follows:

    1. x264 (2010)
    2. x264 (2009)
    3. x264 (2007)
    4. x264 (2006)
    5. x264 (2005)
    These results are shown in next figure. This figure indicates that the overall progress is very good, and that the x264 encoder has increased in speed and quality over recent years. But the old x264 does not use multithreading, so encoding speed varies considerably.

    Interestingly, if Y-PSNR is used as the quality metric, x264 does not exhibit the kind of progress shown in previous figure. This difference is because for past years, the x264 developers specified, for use in testing, the presets optimized for the SSIM metric. Also, note that no encoder (among the different x264 versions) produces the best results simultaneously for both SSIM and PSNR.

    The bitrate handling mechanism for the x264 encoder is quite good for each version, as following figure indicates. Results for previous x264 versions (0.98 of target bitrate) could be explained by a different interpretation of kbps (1,024 versus 1,000 bits per second).

    Bitrate handling for different x264 encoder versions

    The per-frame analysis presented in following figure shows that the main encoding mechanism did not changed significantly.

    Per-frame analysis for different x264 versions—'Battle' sequence, 700kbps

    Appendix 3. Theora vs. x264 Single-thread Comparison


    Multithreaded encoding is not supported by the Theora encoder, but CPUs with four cores are used for the tests. This situation leads to an incorrect speed comparison between Theora and other codecs. To eliminate this disparity, we have turned on the x264 encoder’s single-thread mode and compared the results with those of Theora. The overall speed/quality trade-off for “Movies” and “HDTV” use cases is depicted in next figures.

    The main conclusion is that even in single-thread mode, Theora is a little bit slower and produces much poorer quality than does x264.


    Appendix 4. Test Set of Video Sequences


    Movie Sequences

    "Ice Age"

  • Sequence title: Ice Age
  • Resolution: 720x480
  • Number of frames: 2014
  • Color space: YV12
  • Frames per second: 24
  • Source: MPEG-2 (DVD9), 5.7Mbps
  • Ice Age sequence, frame 500

    This sequence is a fragment from the Ice Age 3 animated movie. This movie has low-contrast portions and high-contrast portions, and it has many types of motion: camera panning, slow motion and very fast motion. Also, it has a scene with colors that differ completely from those of other scenes. Small black letterboxes appear at the top and bottom of the video.

    "Indiana Jones"

  • Sequence title: Indiana Jones
  • Resolution: 704x288
  • Number of frames: 5000
  • Color space: YV12
  • Frames per second: 30
  • Source: MPEG-2 (DVD)
  • Indiana Jones sequence, frame 1

    This sequence is a fragment from the Indiana Jones movie. Compression of this sequence is difficult for two main reasons: the presence of low-contrast scenes and the high level of motion in different scenes. Also, several scenes have very different types of motion, ranging from almost static scenes with talking people to scenes with strong motion (for example, the scene where stones fall).

    "State Enemy"

  • Sequence title: State Enemy
  • Resolution: 720x304
  • Number of frames: 6500
  • Color space: YV12
  • Frames per second: 24
  • Source: MPEG-2 (DVD)
  • State Enemy sequence, frame 1115

    This sequence is a fragment from the Enemy of the State movie. This sequence includes outdoor scenes with strong motion at the beginning when the bicyclist runs, as well as scenes with low motion and indoor scenes with normal motion. This sequence has scenes with different lighting conditions.

    "Up"

  • Sequence title: Up
  • Resolution: 720x480
  • Number of frames: 1920
  • Color space: YV12
  • Frames per second: 24
  • Source: MPEG-2 (DVD9), 6.5Mbps
  • Up sequence, frame 638

    This sequence is a fragment from the Up animated movie. The sequence contains low-contrast scenes with almost static brightness and high-colored scenes, and it contains many scenes with a few frames that include quick scene changes.

    HDTV Sequences

    "Amazon"

  • Sequence title: Amazon
  • Resolution: 1280x720
  • Number of frames: 1200
  • Color space: YV12
  • Frames per second: 24
  • Source: Windows Media (6.4Mbps)
  • Amazon sequence, frame 200

    This sequence contains scenes with the camera panning over a landscape view of the Amazon; also, some scene changes take place with the camera panning again. The video frames consist of many edges and high-contrast, sharp details, such as leaves and branches. This sequence was downloaded from the Microsoft website: http://www.microsoft.com/windows/windowsmedia/musicandvideo/hdvideo/contentshowcase.aspx

    "Iron Man"

  • Sequence title: Iron Man
  • Resolution: 1920x1080
  • Number of frames: 600
  • Color space: YV12
  • Frames per second: 24
  • Source: H.264, 14Mbps
  • Iron Man sequence, frame 455

    This sequence is a part of the Iron Man 2 movie trailer. It has low-brightness scenes at the beginning, followed by a scene with very fast motion and scenes with slow camera panning. Some frames are very blurry. Also, the sequence contains scenes with flashing lights, which could be a big problem for some encoders. Black letterboxes appear at the top and bottom of the video.

    "Mobile Calendar"

  • Sequence title: Mobile Calendar
  • Resolution: 1280x720
  • Number of frames: 504
  • Color space: YV12
  • Frames per second: 50
  • Source: Uncompressed, progressive
  • Mobile Calendar sequence, frame 416

    This close-up sequence is similar to “Mobile&Calendar” and includes a moving calendar with text and a detailed photo of the Vasa ship. It also includes a moving train with colorful toys. The background has two types of wallpaper: one is brown with details, and the other is yellow with drawn figures. The sequence is very detailed and is normally demanding. The main potential compression difficulty is the many small, sharp details on the calendar and on the background.

    "Troy"

  • Sequence title: Troy
  • Resolution: 1920x1072
  • Number of frames: 300
  • Color space: YV12
  • Frames per second: 24
  • Source: MPEG-2
  • Troy sequence, frame 1

    This sequence is a fragment of the “Troy” movie and contains three parts with sharp scene changes. The video includes medium scene motion and slow camera motion. In terms of compression, this sequence is difficult to compress because of the many small details.


    Appendix 5. Tested Codecs and Presets


    Codecs

    DivX AVC/H.264 Video Encoder

  • Console encoding program version 1.1.1.6
  • Presets were chosen by ourselves to meet the comparison requirements
  • Remarks: Owing to our choice of presets, the results for the DivX H.264 encoder could be slightly diminished compared with the case where the developers provide the presets.

    Elecard AVC Video Encoder 8-bit edition

  • Console encoding program version 2.1.022202.091207
  • Codec and presets were provided by Elecard Ltd Company specifically for this test
  • Intel® MediaSDK AVC/H.264

  • Console encoding program, version 1.10.1.15
  • Codec and presets were provided by Intel Corp. specifically for this test
  • MainConcept AVC/H.264 Video Encoder Console Application

  • Console encoding version 8.5.0
  • Codec and presets were provided by MainConcept AG Company specifically for this test
  • Microsoft Expression Encoder 3

  • GUI encoding program
  • Presets chosen ourselves for the analysis
  • Remarks: Owing to a long initial GUI loading time, the encoding time for Microsoft Expression Encoder is significantly higher than for other encoders. Unfortunately, we were unable to encode multiple files without the GUI loading each time. This is likely one of reasons for the encoder’s poor time results.

    Theora encoder

  • Console encoding program
  • Both encoder and decoder were provided by developers
  • Remarks: Theora’s low encoding speed is explained by the version’s lack of multithreading support. Thus, Theora was tested in single-thread mode.

    x264

  • Console encoding program version core:85 r1442M 781d300
  • Codec and presets were provided by developers specifically for this test
  • Remarks: The presets provided by the developers for this comparison were specifically chosen for the SSIM metric

    Presets

    The table below lists the settings used in this comparison for all of the codecs.
    Presets Presets Presets Presets Presets Presets Presets

    Return to H.264 Video Codec Comparison Report


    Contact Information


    See all MSU Video Codec Comparisons

    MSU video codecs comparisons resources:


    Other Materials


    Video resources:

    Last updated: 28-May-2025


    Server size: 8069 files, 1215Mb (Server statistics)

    Project updated by
    Server Team and MSU Video Group

    Project sponsored by YUVsoft Corp.

    Project supported by MSU Graphics & Media Lab