BS ISO/IEC 14496-10:2014
$215.11
Information technology. Coding of audio-visual objects – Advanced Video Coding
Published By | Publication Date | Number of Pages |
BSI | 2014 | 856 |
PDF Catalog
PDF Pages | PDF Title |
---|---|
16 | Foreword |
18 | 0 Introduction 0.1 Prologue 0.2 Purpose 0.3 Applications 0.4 Publication and versions of this Specification |
20 | 0.5 Profiles and levels 0.6 Overview of the design characteristics |
21 | 0.6.1 Predictive coding 0.6.2 Coding of progressive and interlaced video 0.6.3 Picture partitioning into macroblocks and smaller partitions |
22 | 0.6.4 Spatial redundancy reduction 0.7 How to read this Specification |
23 | 1 Scope 2 Normative references 3 Definitions |
32 | 4 Abbreviations 5 Conventions 5.1 Arithmetic operators |
33 | 5.2 Logical operators 5.3 Relational operators 5.4 Bit-wise operators |
34 | 5.5 Assignment operators 5.6 Range notation 5.7 Mathematical functions |
35 | 5.8 Order of operation precedence |
36 | 5.9 Variables, syntax elements, and tables |
37 | 5.10 Text description of logical operations |
38 | 5.11 Processes 6 Source, coded, decoded and output data formats, scanning processes, and neighbouring relationships 6.1 Bitstream formats |
39 | 6.2 Source, decoded, and output picture formats |
43 | 6.3 Spatial subdivision of pictures and slices |
44 | 6.4 Inverse scanning processes and derivation processes for neighbours 6.4.1 Inverse macroblock scanning process |
45 | 6.4.2 Inverse macroblock partition and sub-macroblock partition scanning process |
46 | 6.4.2.1 Inverse macroblock partition scanning process 6.4.2.2 Inverse sub-macroblock partition scanning process |
47 | 6.4.3 Inverse 4×4 luma block scanning process 6.4.4 Inverse 4×4 Cb or Cr block scanning process for ChromaArrayType equal to 3 6.4.5 Inverse 8×8 luma block scanning process |
48 | 6.4.6 Inverse 8×8 Cb or Cr block scanning process for ChromaArrayType equal to 3 6.4.7 Inverse 4×4 chroma block scanning process 6.4.8 Derivation process of the availability for macroblock addresses 6.4.9 Derivation process for neighbouring macroblock addresses and their availability |
49 | 6.4.10 Derivation process for neighbouring macroblock addresses and their availability in MBAFF frames |
50 | 6.4.11 Derivation processes for neighbouring macroblocks, blocks, and partitions |
51 | 6.4.11.1 Derivation process for neighbouring macroblocks 6.4.11.2 Derivation process for neighbouring 8×8 luma block |
52 | 6.4.11.3 Derivation process for neighbouring 8×8 chroma blocks for ChromaArrayType equal to 3 6.4.11.4 Derivation process for neighbouring 4×4 luma blocks |
53 | 6.4.11.5 Derivation process for neighbouring 4×4 chroma blocks 6.4.11.6 Derivation process for neighbouring 4×4 chroma blocks for ChromaArrayType equal to 3 6.4.11.7 Derivation process for neighbouring partitions |
55 | 6.4.12 Derivation process for neighbouring locations 6.4.12.1 Specification for neighbouring locations in fields and non-MBAFF frames |
56 | 6.4.12.2 Specification for neighbouring locations in MBAFF frames |
57 | 6.4.13 Derivation processes for block and partition indices |
58 | 6.4.13.1 Derivation process for 4×4 luma block indices 6.4.13.2 Derivation process for 4×4 chroma block indices 6.4.13.3 Derivation process for 8×8 luma block indices 6.4.13.4 Derivation process for macroblock and sub-macroblock partition indices |
59 | 7 Syntax and semantics 7.1 Method of specifying syntax in tabular form |
60 | 7.2 Specification of syntax functions, categories, and descriptors |
62 | 7.3 Syntax in tabular form 7.3.1 NAL unit syntax |
63 | 7.3.2 Raw byte sequence payloads and RBSP trailing bits syntax 7.3.2.1 Sequence parameter set RBSP syntax |
64 | 7.3.2.1.1 Sequence parameter set data syntax |
65 | 7.3.2.1.1.1 Scaling list syntax |
66 | 7.3.2.1.2 Sequence parameter set extension RBSP syntax |
67 | 7.3.2.1.3 Subset sequence parameter set RBSP syntax |
68 | 7.3.2.2 Picture parameter set RBSP syntax |
69 | 7.3.2.3 Supplemental enhancement information RBSP syntax 7.3.2.3.1 Supplemental enhancement information message syntax |
70 | 7.3.2.4 Access unit delimiter RBSP syntax 7.3.2.5 End of sequence RBSP syntax 7.3.2.6 End of stream RBSP syntax 7.3.2.7 Filler data RBSP syntax 7.3.2.8 Slice layer without partitioning RBSP syntax |
71 | 7.3.2.9 Slice data partition RBSP syntax 7.3.2.9.1 Slice data partition A RBSP syntax 7.3.2.9.2 Slice data partition B RBSP syntax 7.3.2.9.3 Slice data partition C RBSP syntax 7.3.2.10 RBSP slice trailing bits syntax |
72 | 7.3.2.11 RBSP trailing bits syntax 7.3.2.12 Prefix NAL unit RBSP syntax 7.3.2.13 Slice layer extension RBSP syntax |
73 | 7.3.3 Slice header syntax |
75 | 7.3.3.1 Reference picture list modification syntax |
76 | 7.3.3.2 Prediction weight table syntax |
77 | 7.3.3.3 Decoded reference picture marking syntax |
78 | 7.3.4 Slice data syntax |
79 | 7.3.5 Macroblock layer syntax |
80 | 7.3.5.1 Macroblock prediction syntax |
81 | 7.3.5.2 Sub-macroblock prediction syntax |
82 | 7.3.5.3 Residual data syntax |
83 | 7.3.5.3.1 Residual luma syntax |
84 | 7.3.5.3.2 Residual block CAVLC syntax |
85 | 7.3.5.3.3 Residual block CABAC syntax |
86 | 7.4 Semantics 7.4.1 NAL unit semantics |
90 | 7.4.1.1 Encapsulation of an SODB within an RBSP (informative) 7.4.1.2 Order of NAL units and association to coded pictures, access units, and video sequences 7.4.1.2.1 Order of sequence and picture parameter set RBSPs and their activation |
92 | 7.4.1.2.2 Order of access units and association to coded video sequences 7.4.1.2.3 Order of NAL units and coded pictures and association to access units |
94 | 7.4.1.2.4 Detection of the first VCL NAL unit of a primary coded picture |
95 | 7.4.1.2.5 Order of VCL NAL units and association to coded pictures |
96 | 7.4.2 Raw byte sequence payloads and RBSP trailing bits semantics 7.4.2.1 Sequence parameter set RBSP semantics 7.4.2.1.1 Sequence parameter set data semantics |
103 | 7.4.2.1.1.1 Scaling list semantics 7.4.2.1.2 Sequence parameter set extension RBSP semantics |
105 | 7.4.2.1.3 Subset sequence parameter set RBSP semantics 7.4.2.2 Picture parameter set RBSP semantics |
107 | 7.4.2.3 Supplemental enhancement information RBSP semantics 7.4.2.3.1 Supplemental enhancement information message semantics |
108 | 7.4.2.4 Access unit delimiter RBSP semantics 7.4.2.5 End of sequence RBSP semantics 7.4.2.6 End of stream RBSP semantics 7.4.2.7 Filler data RBSP semantics 7.4.2.8 Slice layer without partitioning RBSP semantics |
109 | 7.4.2.9 Slice data partition RBSP semantics 7.4.2.9.1 Slice data partition A RBSP semantics 7.4.2.9.2 Slice data partition B RBSP semantics |
110 | 7.4.2.9.3 Slice data partition C RBSP semantics 7.4.2.10 RBSP slice trailing bits semantics 7.4.2.11 RBSP trailing bits semantics 7.4.2.12 Prefix NAL unit RBSP semantics 7.4.2.13 Slice layer extension RBSP semantics 7.4.3 Slice header semantics |
117 | 7.4.3.1 Reference picture list modification semantics 7.4.3.2 Prediction weight table semantics |
118 | 7.4.3.3 Decoded reference picture marking semantics |
122 | 7.4.4 Slice data semantics |
123 | 7.4.5 Macroblock layer semantics |
130 | 7.4.5.1 Macroblock prediction semantics |
131 | 7.4.5.2 Sub-macroblock prediction semantics |
133 | 7.4.5.3 Residual data semantics |
134 | 7.4.5.3.1 Residual luma data semantics 7.4.5.3.2 Residual block CAVLC semantics |
135 | 7.4.5.3.3 Residual block CABAC semantics 8 Decoding process |
136 | 8.1 NAL unit decoding process |
137 | 8.2 Slice decoding process 8.2.1 Decoding process for picture order count |
139 | 8.2.1.1 Decoding process for picture order count type 0 |
140 | 8.2.1.2 Decoding process for picture order count type 1 |
141 | 8.2.1.3 Decoding process for picture order count type 2 8.2.2 Decoding process for macroblock to slice group map |
143 | 8.2.2.1 Specification for interleaved slice group map type 8.2.2.2 Specification for dispersed slice group map type 8.2.2.3 Specification for foreground with left-over slice group map type 8.2.2.4 Specification for box-out slice group map types |
144 | 8.2.2.5 Specification for raster scan slice group map types 8.2.2.6 Specification for wipe slice group map types 8.2.2.7 Specification for explicit slice group map type |
145 | 8.2.2.8 Specification for conversion of map unit to slice group map to macroblock to slice group map 8.2.3 Decoding process for slice data partitions |
146 | 8.2.4 Decoding process for reference picture lists construction 8.2.4.1 Decoding process for picture numbers |
147 | 8.2.4.2 Initialisation process for reference picture lists 8.2.4.2.1 Initialisation process for the reference picture list for P and SP slices in frames |
148 | 8.2.4.2.2 Initialisation process for the reference picture list for P and SP slices in fields 8.2.4.2.3 Initialisation process for reference picture lists for B slices in frames |
149 | 8.2.4.2.4 Initialisation process for reference picture lists for B slices in fields |
150 | 8.2.4.2.5 Initialisation process for reference picture lists in fields 8.2.4.3 Modification process for reference picture lists |
151 | 8.2.4.3.1 Modification process of reference picture lists for short-term reference pictures |
152 | 8.2.4.3.2 Modification process of reference picture lists for long-term reference pictures 8.2.5 Decoded reference picture marking process |
153 | 8.2.5.1 Sequence of operations for decoded reference picture marking process 8.2.5.2 Decoding process for gaps in frame_num |
154 | 8.2.5.3 Sliding window decoded reference picture marking process |
155 | 8.2.5.4 Adaptive memory control decoded reference picture marking process 8.2.5.4.1 Marking process of a short-term reference picture as “unused for reference” 8.2.5.4.2 Marking process of a long-term reference picture as “unused for reference” 8.2.5.4.3 Assignment process of a LongTermFrameIdx to a short-term reference picture |
156 | 8.2.5.4.4 Decoding process for MaxLongTermFrameIdx 8.2.5.4.5 Marking process of all reference pictures as “unused for reference” and setting MaxLongTermFrameIdx to “no long-term frame indices” 8.2.5.4.6 Process for assigning a long-term frame index to the current picture |
157 | 8.3 Intra prediction process 8.3.1 Intra_4x4 prediction process for luma samples |
158 | 8.3.1.1 Derivation process for Intra4x4PredMode |
160 | 8.3.1.2 Intra_4x4 sample prediction |
161 | 8.3.1.2.1 Specification of Intra_4x4_Vertical prediction mode 8.3.1.2.2 Specification of Intra_4x4_Horizontal prediction mode 8.3.1.2.3 Specification of Intra_4x4_DC prediction mode |
162 | 8.3.1.2.4 Specification of Intra_4x4_Diagonal_Down_Left prediction mode 8.3.1.2.5 Specification of Intra_4x4_Diagonal_Down_Right prediction mode 8.3.1.2.6 Specification of Intra_4x4_Vertical_Right prediction mode |
163 | 8.3.1.2.7 Specification of Intra_4x4_Horizontal_Down prediction mode 8.3.1.2.8 Specification of Intra_4x4_Vertical_Left prediction mode 8.3.1.2.9 Specification of Intra_4x4_Horizontal_Up prediction mode |
164 | 8.3.2 Intra_8x8 prediction process for luma samples 8.3.2.1 Derivation process for Intra8x8PredMode |
166 | 8.3.2.2 Intra_8x8 sample prediction |
167 | 8.3.2.2.1 Reference sample filtering process for Intra_8x8 sample prediction |
168 | 8.3.2.2.2 Specification of Intra_8x8_Vertical prediction mode 8.3.2.2.3 Specification of Intra_8x8_Horizontal prediction mode |
169 | 8.3.2.2.4 Specification of Intra_8x8_DC prediction mode 8.3.2.2.5 Specification of Intra_8x8_Diagonal_Down_Left prediction mode |
170 | 8.3.2.2.6 Specification of Intra_8x8_Diagonal_Down_Right prediction mode 8.3.2.2.7 Specification of Intra_8x8_Vertical_Right prediction mode 8.3.2.2.8 Specification of Intra_8x8_Horizontal_Down prediction mode |
171 | 8.3.2.2.9 Specification of Intra_8x8_Vertical_Left prediction mode 8.3.2.2.10 Specification of Intra_8x8_Horizontal_Up prediction mode |
172 | 8.3.3 Intra_16x16 prediction process for luma samples |
173 | 8.3.3.1 Specification of Intra_16x16_Vertical prediction mode 8.3.3.2 Specification of Intra_16x16_Horizontal prediction mode 8.3.3.3 Specification of Intra_16x16_DC prediction mode 8.3.3.4 Specification of Intra_16x16_Plane prediction mode |
174 | 8.3.4 Intra prediction process for chroma samples |
175 | 8.3.4.1 Specification of Intra_Chroma_DC prediction mode |
177 | 8.3.4.2 Specification of Intra_Chroma_Horizontal prediction mode 8.3.4.3 Specification of Intra_Chroma_Vertical prediction mode 8.3.4.4 Specification of Intra_Chroma_Plane prediction mode |
178 | 8.3.4.5 Intra prediction for chroma samples with ChromaArrayType equal to 3 8.3.5 Sample construction process for I_PCM macroblocks |
179 | 8.4 Inter prediction process |
182 | 8.4.1 Derivation process for motion vector components and reference indices |
183 | 8.4.1.1 Derivation process for luma motion vectors for skipped macroblocks in P and SP slices 8.4.1.2 Derivation process for luma motion vectors for B_Skip, B_Direct_16x16, and B_Direct_8x8 |
184 | 8.4.1.2.1 Derivation process for the co-located 4×4 sub-macroblock partitions |
187 | 8.4.1.2.2 Derivation process for spatial direct luma motion vector and reference index prediction mode |
188 | 8.4.1.2.3 Derivation process for temporal direct luma motion vector and reference index prediction mode |
191 | 8.4.1.3 Derivation process for luma motion vector prediction |
192 | 8.4.1.3.1 Derivation process for median luma motion vector prediction |
193 | 8.4.1.3.2 Derivation process for motion data of neighbouring partitions |
194 | 8.4.1.4 Derivation process for chroma motion vectors 8.4.2 Decoding process for Inter prediction samples |
195 | 8.4.2.1 Reference picture selection process |
196 | 8.4.2.2 Fractional sample interpolation process |
198 | 8.4.2.2.1 Luma sample interpolation process |
200 | 8.4.2.2.2 Chroma sample interpolation process |
201 | 8.4.2.3 Weighted sample prediction process |
202 | 8.4.2.3.1 Default weighted sample prediction process |
203 | 8.4.2.3.2 Weighted sample prediction process 8.4.3 Derivation process for prediction weights |
206 | 8.5 Transform coefficient decoding process and picture construction process prior to deblocking filter process 8.5.1 Specification of transform decoding process for 4×4 luma residual blocks |
207 | 8.5.2 Specification of transform decoding process for luma samples of Intra_16x16 macroblock prediction mode |
208 | 8.5.3 Specification of transform decoding process for 8×8 luma residual blocks 8.5.4 Specification of transform decoding process for chroma samples |
210 | 8.5.5 Specification of transform decoding process for chroma samples with ChromaArrayType equal to 3 8.5.6 Inverse scanning process for 4×4 transform coefficients and scaling lists |
211 | 8.5.7 Inverse scanning process for 8×8 transform coefficients and scaling lists |
213 | 8.5.8 Derivation process for chroma quantisation parameters 8.5.9 Derivation process for scaling functions |
215 | 8.5.10 Scaling and transformation process for DC transform coefficients for Intra_16x16 macroblock type 8.5.11 Scaling and transformation process for chroma DC transform coefficients |
216 | 8.5.11.1 Transformation process for chroma DC transform coefficients 8.5.11.2 Scaling process for chroma DC transform coefficients |
217 | 8.5.12 Scaling and transformation process for residual 4×4 blocks |
218 | 8.5.12.1 Scaling process for residual 4×4 blocks |
219 | 8.5.12.2 Transformation process for residual 4×4 blocks |
220 | 8.5.13 Scaling and transformation process for residual 8×8 blocks |
221 | 8.5.13.1 Scaling process for residual 8×8 blocks 8.5.13.2 Transformation process for residual 8×8 blocks |
224 | 8.5.14 Picture construction process prior to deblocking filter process |
225 | 8.5.15 Intra residual transform-bypass decoding process |
226 | 8.6 Decoding process for P macroblocks in SP slices or SI macroblocks 8.6.1 SP decoding process for non-switching pictures 8.6.1.1 Luma transform coefficient decoding process |
228 | 8.6.1.2 Chroma transform coefficient decoding process |
229 | 8.6.2 SP and SI slice decoding process for switching pictures 8.6.2.1 Luma transform coefficient decoding process |
230 | 8.6.2.2 Chroma transform coefficient decoding process |
231 | 8.7 Deblocking filter process |
235 | 8.7.1 Filtering process for block edges |
237 | 8.7.2 Filtering process for a set of samples across a horizontal or vertical block edge |
238 | 8.7.2.1 Derivation process for the luma content dependent boundary filtering strength |
239 | 8.7.2.2 Derivation process for the thresholds for each block edge |
240 | 8.7.2.3 Filtering process for edges with bS less than 4 |
242 | 8.7.2.4 Filtering process for edges for bS equal to 4 |
243 | 9 Parsing process 9.1 Parsing process for Exp-Golomb codes |
245 | 9.1.1 Mapping process for signed Exp-Golomb codes 9.1.2 Mapping process for coded block pattern |
248 | 9.2 CAVLC parsing process for transform coefficient levels |
249 | 9.2.1 Parsing process for total number of non-zero transform coefficient levels and number of trailing ones |
253 | 9.2.2 Parsing process for level information 9.2.2.1 Parsing process for level_prefix |
254 | 9.2.3 Parsing process for run information |
257 | 9.2.4 Combining level and run information 9.3 CABAC parsing process for slice data |
259 | 9.3.1 Initialisation process 9.3.1.1 Initialisation process for context variables |
283 | 9.3.1.2 Initialisation process for the arithmetic decoding engine 9.3.2 Binarization process |
287 | 9.3.2.1 Unary (U) binarization process |
288 | 9.3.2.2 Truncated unary (TU) binarization process 9.3.2.3 Concatenated unary/ k-th order Exp-Golomb (UEGk) binarization process |
289 | 9.3.2.4 Fixed-length (FL) binarization process 9.3.2.5 Binarization process for macroblock type and sub-macroblock type |
292 | 9.3.2.6 Binarization process for coded block pattern 9.3.2.7 Binarization process for mb_qp_delta 9.3.3 Decoding process flow |
293 | 9.3.3.1 Derivation process for ctxIdx |
295 | 9.3.3.1.1 Assignment process of ctxIdxInc using neighbouring syntax elements 9.3.3.1.1.1 Derivation process of ctxIdxInc for the syntax element mb_skip_flag |
296 | 9.3.3.1.1.2 Derivation process of ctxIdxInc for the syntax element mb_field_decoding_flag 9.3.3.1.1.3 Derivation process of ctxIdxInc for the syntax element mb_type 9.3.3.1.1.4 Derivation process of ctxIdxInc for the syntax element coded_block_pattern |
297 | 9.3.3.1.1.5 Derivation process of ctxIdxInc for the syntax element mb_qp_delta 9.3.3.1.1.6 Derivation process of ctxIdxInc for the syntax elements ref_idx_l0 and ref_idx_l1 |
299 | 9.3.3.1.1.7 Derivation process of ctxIdxInc for the syntax elements mvd_l0 and mvd_l1 |
300 | 9.3.3.1.1.8 Derivation process of ctxIdxInc for the syntax element intra_chroma_pred_mode 9.3.3.1.1.9 Derivation process of ctxIdxInc for the syntax element coded_block_flag |
303 | 9.3.3.1.1.10 Derivation process of ctxIdxInc for the syntax element transform_size_8x8_flag 9.3.3.1.2 Assignment process of ctxIdxInc using prior decoded bin values |
304 | 9.3.3.1.3 Assignment process of ctxIdxInc for syntax elements significant_coeff_flag, last_significant_coeff_flag, and coeff_abs_level_minus1 |
307 | 9.3.3.2 Arithmetic decoding process |
308 | 9.3.3.2.1 Arithmetic decoding process for a binary decision |
309 | 9.3.3.2.1.1 State transition process |
311 | 9.3.3.2.2 Renormalization process in the arithmetic decoding engine |
312 | 9.3.3.2.3 Bypass decoding process for binary decisions |
313 | 9.3.3.2.4 Decoding process for binary decisions before termination |
314 | 9.3.4 Arithmetic encoding process (informative) 9.3.4.1 Initialisation process for the arithmetic encoding engine (informative) |
315 | 9.3.4.2 Encoding process for a binary decision (informative) |
316 | 9.3.4.3 Renormalization process in the arithmetic encoding engine (informative) |
318 | 9.3.4.4 Bypass encoding process for binary decisions (informative) |
319 | 9.3.4.5 Encoding process for a binary decision before termination (informative) |
320 | 9.3.4.6 Byte stuffing process (informative) |
322 | Annex A (normative) Profiles and levels |
343 | Annex B (normative) Byte stream format |
346 | Annex C (normative) Hypothetical reference decoder |
367 | Annex D (normative) Supplemental enhancement information |
428 | Annex E (normative) Video usability information |
447 | Annex F (informative) Patent Rights |
449 | Annex G (normative) Scalable video coding |
680 | Annex H (normative) Multiview video coding |
750 | Annex I (normative) Multiview and depth video coding |
796 | Annex J (normative) Multiview and depth video with enhanced non-base view coding |