Designing an effective and high performance network requires an accurate characterization and modeling of network traffic. The modeling of video frame sizes is normally applied in simulation studies and mathematical analysis and generating streams for testing and compliance purposes. Besides, video traffic assumed as a major source of multimedia traffic in future heterogeneous network. Therefore, the statistical distribution of video data can be used as the inputs for performance modeling of networks. The finding of this paper comprises the theoretical definition of distribution which seems to be relevant to the video trace in terms of its statistical properties and finds the best distribution using both the graphical method and the hypothesis test. The data set used in this article consists of layered video traces generating from Scalable Video Codec (SVC) video compression technique of three different movies.
Universiti Sains Malaysia304/PNAV/650817/C1271. Introduction
Generally, a thorough understanding of the traffic and quality characteristics of encoded video is the basis for traffic modeling and the development of video transport mechanisms [1]. Multimedia transmissions have imposed a huge amount of the today traffic over computer and mobile communication networks. This can be done by simply using a live experiment using real networks and real sources. However, testing real networks is fairly expensive and often it is difficult to come up with realistic results. Another solution to this would be to model the traffic using mathematical analysis or simulation. Trace-driven simulations are thought reliable because they represent an actual traffic load; nevertheless they are usually static and so they provide merely a point representation of the workload space. One more disadvantage of using traces is the difficulty in adjusting parameters and extending the trace if there is a need to continue the simulation beyond the number of packets/frames in the trace file [2]. With this intention, statistical and mathematical traffic models are assumed as better solutions since they can be used to provide a better understanding of various traffic characteristics. This is because they are stochastic in nature, and hence different realizations that represent the actual data can be obtained by varying model parameters.
Among the various characteristics of video traffic, the following two are of major interest in literature:
Distribution of frame sizes.
Autocorrelation Function (ACF) that captures common dependencies between frame sizes in VBR video.
Among all multimedia applications, video services are demonstrated as the most common ones for generating traffic in communication networks. Obviously, the raw video data requires very high transmission bandwidth and large amount of storage space [3]. Therefore, using video compression techniques is highly recommended and there exist different types of network traffic based on their application. The focus of this paper is on video traces generated from a Scalable Video Codec (SVC) as a compression technique.
SVC is an extension of H.264/AVC which is standardized by The Joint Video Team of the ITU-T VCEG and the ISO/IEC MPEG H.264/AVC standard [4]. SVC has been proposed to support bandwidth efficient and loss resilient video streaming. Meanwhile, the encoding structure of SVC includes one base layer and one or more enhancement layers. H.264 SVC supports layer-scalable coding which presents Temporal Scalability, spatial scalability, and quality (SNR) Scalability [5]. SVC provides two types of quality scalability, known as coarse grain scalability (CGS) and medium grain scalability (MGS). In this paper, the statistical analysis of CGS encoding has been pondered.
In this strategy, each layer has an independent prediction procedure (all references have the same quality level) in a similar fashion to the MPEG-2. In fact, the CGS strategy can be regarded as a special case of spatial scalability when consecutive layers have the same resolution [6]. Coarse grain SNR scalable coding is achieved using the concepts for spatial scalability. The same interlayer prediction mechanisms are employed. The only difference is that base and enhancement layers have the same resolution. The CGS only allows a few selected bitrates to be supported in a scalable bitstream. In general, the number of supported rate points is identical to the number of layers. Switching between different CGS layers can only be done at defined points in the bitstream [7].
Communication network measurements have indicated that many quantities which are characterizing the network performance have long-tail probability distributions. The quantities have the tails that decay more slowly than exponential. This long-tail behavior is mostly related to the terms such as file lengths, call holding times, scene lengths in video streams, and intervals between connection requests in Internet traffic. Long-tail distributions can have a significant effect on performance [8]. long-tail service-time distributions lead to long-tail waiting-time distributions in the queues [9]. Since performance models with long-tail distributions are usually difficult to analyze, it is usually difficult to describe them in detail. To address this problem, finding the best distribution among all of the different distributions has come to account. The aim is to derive a statistical distribution to fit the real data accurately. Since they are the most common distributions related to data with long-tails distributions they were chosen. In other words these data have probability distributions with high skewness which are difficult to analysis by usual statistical models. Therefore, nonsymmetric distributions need to be addressed in this study.
The organization of this paper is as follows. The notable frame size distribution related works are presented in Section 2. Section 3 describes the methodology of the study in which different distributions are explained with the statistical properties in detail. Meanwhile, Section 4 performs the result of fitting these distributions to the data based on statistical criteria. Finally, the conclusion of this study is provided in Section 5.
2. Related Works
Several works have been conducted in order to analyze the video frame sizes. The early work performed by Heyman et al. [10, 11] and Xu and Huang [12] presented the marginal distribution of videoconference encoded by H.261 which were generated by different hardware coders with different coding algorithms is gamma distribution. Aforementioned authors applied this result to design a discrete autoregressive model (DAR) of order one. Krunz and Hughes in [13] modeled the frame sizes which are compressed by MPEG-2 standard. In this study, the best fit for the distribution of frame sizes was lognormal distribution among gamma, Weibull, and lognormal distributions. They used the fitting distribution for three types of frame sizes such as I frames, B frames, and P frames. Fitzek and Reisslein [14] have provided a public available library of frame size sequences including MPEG-4, H. 263, and H.263+ encoded video with a detail of statistical analysis of generated traces. It was shown that the movies as visual content cause a frame generation with gamma-like frame size histogram. Poon and Lo [15] presented that a normal mixture distribution for fitting the sample histogram of video traces encoded by H.261 and H.263. It was proved that this method is better than simple Gamma and Lognormal. Lazaris et al. [16] indicated that Gamma and Lognormal distribution are not always the best fit for MPEG-4 videoconference traffic. Furthermore, they presented the notion that for single videoconference sources the best fit is Pearson type V distribution among all examined distributions. Koumaras et al. [3] discovered that gamma distribution is the best fit for frame sizes compressed by H.264 standard where this fits three types of video frame. Furthermore, Masi et al. in [17] indicated that the Erlang or gamma distributions are fitted appropriately in to the three data sets of actual video frame sizes. In their work the data set consists of two different video compression standards, H.263 and H.264. The best fit of frame size distributions was used to generate packet streams for use in packet level congestion models. Salah et al. in [18] figured out that gamma is a well fitted distribution to the data and Weibull distributions and inverse Gaussian distribution is ranked second after these distributions.
For modeling single source trace, the best distribution needs to be found. Although there were few studies that analyzed packet size distribution, this article considered frame size analysis as output of SVC layers.
3. Evaluation Setup of Video Sequences
The data set presented in this paper consists of three different video traces with the CIF (352×288) resolution, a frame rate of 30 frames/second, GOP pattern: G16B15, and the quantization parameters (I, P, and B): 48, N/A, 48 taken from [19]. A video trace characterizes an encoded video stream by providing time stamp, frame type (e.g., I, P, or B), frame size (in byte), and PSNR quality for each encoded frame (and layer of a scalable encoding). Video traces can be readily fed into simulation models of video transport systems, thus, facilitating the evaluation of novel transport mechanism. The video traces under this study include the following:
NBC News sequence (48992 frames) 60 minutes long divided into one base layer and one enhancement layer in which the frame types are intraframes (I frames) and bidirectional frames (B frames).
Sony Demo sequence (17,664 frames) 60 minutes long divided into one base layer and one enhancement layer same as the previous video sequence with I and B frame types.
Silence of the Lambs sequence (53984 frames) with the similar properties of the former sequences.
Above-mentioned sequences demonstrate video sequences with low or moderate scene changes. The particular encoder tool which is used for encoding purpose is JSVM encoder taken from [19].
4. Methodology
The proper methodology for the paper is as follows:
Investigate the hypothesized distribution families which are suitable in terms of the overall shape of the data under the study.
Estimate the parameters of selected distributions by writing code.
Find the best distribution for the data by goodness of fit tests.
Investigate the autocorrelation function.
Each of these steps will be described in detail in next subsections. As the distributions studied in this paper are widely used in most literature, they will be addressed in terms of statistical theory and the relevant applications in the video traffic modeling.
4.1. Investigating the Different Distributions
Since there are numerous statistical distributions, it is not common to investigate all of them to find the best one for the data set. Therefore, plotting the density function of the data provides a preliminary point of view to identify what kinds of distributions should be studied.
Figure 1 showed that the shape of density function of NBC News with different layers and frame types is not symmetric and has high skewness. This plot had been performed for other two movies and same results are concluded. However, due to the space limitation, they were not presented for every step of implementation for rest of the article. Hence, to implement the fitting of an appropriate distribution to the data, the more common distributions with skewness include exponential, lognormal, logistic, log-logistic, Weibull, gamma, normal, inverse Gaussian, negative-binomial, and Pearson family distributions. In this article, the Pearson distribution was identified in detail, since it contains four parameters which lead to achieve more appropriate fit.
(a) The probability density function of I frame, in base layer of NBC News. (b) The probability density functions of B frame, in base layer of NBC News. (c) The probability density function of I frame, in enhancement layer of NBC News. (d) The probability density function of I frame, in enhancement layer of NBC News.
4.1.1. Pearson Distribution
Since one of the most common used distributions in literature is Pearson distribution, four-parameter system of probability density functions, which is provided by Abate et al. [9], will be explained in more detail here. On the other hand, four parameters compromise location, scale, skewness, and kurtosis of the distribution and describe the data better than the density functions with less parameters. More parameters in a distribution need more precise description of the data. Depending on some conditions which will be discussed later, there are five Pearson distribution types. In fact, skewness and kurtosis characterize these Pearson types. Let γ1 and γ2 denote skewness and kurtosis of the distribution; then some reparameterization is constructed as follows:(1)β1=γ12β2=γ2+3.A valid solution of the following differential equation specifies a Pearson density family:(2)p′xpx+a+x-λb2x-λ2+b1x-λ+b0=0;therefore:(3)p′x=-pxa+x-λb2x-λ2+b1x-λ+b0.The different types of Pearson distributions can be obtained by solving (3). It is wise to call equation (3) as a first-order linear differential equation with variable coefficients. For simplicity, λ is assumed zero. Thus, (6) can be rewrite as follows:(4)p′x+pxa+xb2x2+b1x+b0=0.To clarify how this equation will be solved, a simple linear order differential equation is described which has the general form of(5)Dyx+fxyx=gx.At first the proof of (5) is provided in Appendix A in detail and then the assumptions (6) and (7) can be released.(6)fx=a+xb2x2+b1x+b0=0,yx=pxgx=0,(7)px∝e-∫x-a/b2x2+b1x+b0dx.Now it is time to solve (7) which is the primer beginning of different types of Pearson distribution. According to the discriminate of equation in denominator, there are two general cases which leads to production of different kinds of Pearson distribution which are displayed in next subsection.
(A) Case 1 Nonnegative Discriminate. If the discrimination is positive, the equation contains two roots named a1 and a2, being maybe the same:(8)a1=-b1-b12-4b0b22b2a2=-b1+b12-4b0b22b2.A complete solution of (7) is delivered in Appendix B and the following results have been obtained at the end:(9)k=1b2a1-a2⟹px∝x-a1-ka1-ax-a2ka2-a.The density is only proportional to this mathematical phase, 4 types of the Pearson distribution can be obtained as follows.
(I) Pearson Type 1 Distribution. The Pearson type 1 distribution occurs when the denominator of (7) includes opposite sign roots, a1<0<a2. For more detailed proof please refer to the Appendix C.(10)fx=Γm1+m2+2Γm1+4Γm1+1x-λa2-a1m11-x-λa2-a1m2.
(II) The Pearson Type 2 Distribution. This distribution is a specified case of Beta distribution which is symmetric [20]. The probability density function is as follows:(11)fx=Γ2m1+1Γm1+12x-λa2-a11-x-λa2-a1m1.
(III) The Pearson Type 3 Distribution. If scale parameter is allowed to be negative, negative skewness, the Pearson type 3 distribution is obtained as follows [21]:(12)fx=1a2-a1m1+1Γm1+1x-λm1e-x-λ/a2-a1.
(IV) The Pearson Type 5 Distribution. Finally, the Pearson type IV distribution is inverse gamma distribution in which again negative scale parameter is allowed [22]. So, the probability density function is as follows:(13)fx=1a2-a1m1+1Γm1+1x-λ-m1-2e-a2-a1/x-λ.
(B) Case 2: Negative Discriminate-Pearson Type 4 Distribution. In this case the discriminant of (7) is negative (b12-4b2b0<0). The denominator of (7) contains no roots. A detailed proof of Pearson type 4 is given in Appendix D.(14)px=Γm+v/2i/Γmαβm-1/2,1/221+x-λα2-mexp-varctangx-λα.This is the Pearson type IV distribution [23].
Eventually, there are five types of Pearson distribution based on positive or negative of discriminant cases of quadratic function in the basic p(x) in this distribution. Furthermore, many well-known distributions are special case of these types of the Pearson distributions. Table 1 provides the summary of distributions.
The special cases of Pearson types.
Distribution
Form of distribution
Type
Beta distribution
fx;α,β=1Bα,βxα-11-xβ-1
1
Cauchy distribution
fx;x0,γ=1nγx-x02+γ
4
Chi-square distribution
fx;k=xk/2-1e-x/22k/2Γk/2
3
Continuous uniform distribution
f(x)=1b-aa≤x≤b
1
Gamma distribution
fx;k,θ=1θk1Γkxk-1e-x/θ
3
Inverse-chi-squared distribution
f(x;k)=2-k/2Γk/2x-k/2-1e-1/2x
5
Inverse Gamma distribution
fx;α,β=βαΓαx-α-1e-β/x
5
After describing the different distributions, the parameter estimation is the second step of methodology which will be addressed in the next subsection.
4.2. Estimating the Parameters
In statistics, there are three general estimation methods including the following:
Maximum Likelihood estimation method [24].
Least square estimation method.
Moment estimation method.
In most statistical analyses, maximum likelihood estimation (MLE) method is used to estimate parameters because it produces unbiased, consistent, and efficient estimates as it is stated by Myung [25]. Based on Kelton and Law [26] the MLE of a distribution are the parameters of that function that maximize the likelihood of the distribution given a set of observational data. Given a set of observational data x and a probability density function (PDF) f the likelihood function is(15)l=∏i=1nfxi, parameters.MLE tends to determine the values of the parameters that maximize the function l. Therefore, this paper is focused MLE method as well for parameter estimation.
4.3. Find the Best Distribution
The final step of the methodology is finding the best distribution for the data set. In statistics, there are two common approaches to fit the best distribution to the data: graphical methods and hypothesis testing which will be explained in next subsections.
4.3.1. Graphical Methods
Graphical method is a way to represent the results from the fitting process. It visually, evaluates how well a distribution matches with the input data. The most widely used and powerful goodness of fit test is quantile-quantile (Q-Q). Q-Q plot is used to calculate the quantile values of two probability distributions and plot them against each other. A point (x,y) on the plot is related to one of the quantiles of the data (x-axes) plotted against the quantiles of the considered distribution (y-axes). Therefore, if the fit is good, the points of the plot will lie approximately along the line y=x (45° reference line).
4.3.2. Hypothesis Test
Generally, in statistic, there are three different statistical hypotheses [27] known as follows:
Comparing two models which are placed in H0 and H1.
Testing parameters of a hypothesized model which are put in H0 and H1.
Comparing two different distributions which are stated in H0 and H1.
The last one is the goal of this study. To test all of these hypothesizes, the appropriate statistic, a variable with a specified distribution, needs to be established. One popular statistic to test the considered hypothesis in this study is the Kolmogorov-Smirnov (K-S) statistic which is constructed based on the cumulative distribution functions. K-S test is a nonparametric and distribution free test [28]. The K-S test applies the maximum vertical deviation among the two plots and can be identified in the manner similar to the ones described in [16].(16)D=supxFx-Gx,where Fx is an empirical distribution function of the real video traffic and Gx is the cumulative distribution function of the model.
The small values of D lead to accepting the null hypothesis in this study:(17)H0:Fx=Gxversus H1:Fx≠Gx.The main result needs to be obtained based on p value. If p value is less than 0.05, usual significant level, the null hypothesis, H0, is rejected. The distribution Fx is a suitable probability function for the data if the p value is larger than 0.05.
4.4. Autocorrelation Function
Capturing the ACF structure of VBR video traffic is actually more challenging because of the fact that VBR traces exhibit both Long-Range Dependent (LRD) and short-range dependent (SRD) properties. If the autocorrelation function decays exponentially fast it can be referred to as an SRD process, but if it decays slowly, then the source is referred to as an LRD process.
As it can be seen from the Figure 2 the autocorrelation remains high even for large numbers of lags and it decay very slowly; both these facts are a clear indication of the importance of LRD. So, strong autocorrelation coefficients are found due to the periodic repetition of I, B, and P frames, and the autocorrelation function has a very slow exponential decrease.
(a) and (b) ACF of NBC NEWS base layer and enhancement layer and (c) and (d) ACF of Sony Demo base layer and enhancement layer.
5. Result and Discussion
As mentioned, in this paper, SVC video trace includes two layers called base layer and enhancement layer; each layer consists of I frames and B frames. The video traces that are considered here belong to NBC News, Silence of the Lambs, and Sony Demo with low or moderate scene changes. To fit the different distributions and choose the best one, the data are analyzed in R software [29], statistical software, which provides the K-S statistics, its p value, and the Q-Q plots. For the first step the graphical evaluation of the distributions using Q-Q plot and K-S test was performed.
5.1. Q-Q Plot
All distributions being studied should be fitted to the data to determine the closest distribution to the actual data. To do so, the Q-Q plot was employed to detect the closer graph to the reference line, which is the best fit.
As it can be seen from Figure 3, the exponential distribution is further from the reference line compared to the other distributions in both figures. The Pearson curve is quite nearer to the reference line in comparison with other curves graphically. Although the tails are quite far in bigger size of frames, Pearson distribution can be considered as the best fit. In other words, the Pearson distribution type IV is the best distribution for NBC News in all cases which is one of the four-parameter distributions describing the data with high accuracy degree.
(a) The Q-Q plot for I frame, base layer of NBC News. (b) The Q-Q plot for B frame, base layer of NBC News.
5.2. K-S Test
In order to validate the previous results, another test, known as K-S, is performed on all of the examined data. This test is able to determine if two datasets are significantly different. One of the advantages of K-S test is that it makes no assumption about the distribution of the data [16]. The closer the graph is to the reference line, the less it was subjected to the K-S test.
Figure 4 showed that the K-S graph belonging to the Pearson distribution was the closest to the reference line. This was confirmed by the value of K-S test, which was 0.0502 for the NBC News of base layer and 0.0294 for the NBC News of the enhancement layer.
(a) The K-S test of the frame size of I frames in the NBC News base layer and (b) the K-S test of the frame size of I frames in the NBC News enhancement layer.
5.3. Statistical Evaluation
In order to provide the statistical confirmation to the visual finding from to the earlier section, a statistical test has been performed. The lower K-S value causes the better fit. This can be verified by p value in a way that the highest p value belongs to the better fit.
Table 2 indicates that the best distribution, which is fitted to the data, is the Pearson distribution type IV for the NBC News base layer and the enhancement layer (I and B frames). These findings were derived from the K-S tests and the p values. As for base layer of the I frames in the NBC News, the K-S test and p value related to the Pearson distribution type IV are 0.0502, which is the lowest, and 0.1105, which is the highest among the other distributions. In the same vein, the corresponding values for the B frames are 0.0223 and 0.2214, respectively. Although the p value exceeds 0.05 (significant level for testing), this value is better than the others, making it suitable for the best distribution in the context of this sequence of frame sizes.
The statistical evaluation for the distributions of NBC News.
For the enhancement layer of I frames in the NBC News, the lowest K-S test, which was 0.0332, and the highest p-value, which was 0.0474, belong to Pearson type IV. Similarly, for the B frames, those values are 0.0204 and 0.2824, respectively, which proves that the Pearson distribution type IV is the best. A similar form of analysis was done for both Silence of the Lambs and Sony Demo in both I and B frames of the base layer and enhancement layer and again, due to the presence of space limitation, the results are not shown. The results show that for all of the p values, which are effectively less than 0.05, none of the distributions are well fitted to these data, and, however, the lowest K-S test value or the maximum p value can be chosen for an appropriate distribution for the data. Among those distributions with the skewness mentioned above, the Pearson distribution resulted in the best fitting for all video traces in the study (for both I frames and B frames).
6. Conclusion
In this study, frame types and the corresponding layers of SVC video traces are considered as data set for which the different distributions are investigated. Eventually, the density function of frame sizes has right skewness tail. It is also true for other video traces under the study, although the limitation of space does not allow showing all of them here. As it was discussed, the Pearson distributions are appropriate to fit this data set. To achieve this finding, the analysis was started with plotting the density function of data to get some information on the data. Then by applying the Q-Q plot and K-S test, each of distribution was fitted to the data. Finally, based on the k-s statistic and the p value, the best distribution was chosen which was shown by the Q-Q plot too. Although there have been some studies in which the Pearson distributions were chosen as the best distribution, this result was for H.264 and MPEG encoded video data. Except the B frame trace of base layer, it is worth noting that the findings of this study result in the Pearson distributions as the best ones related to SVC video data for the first time and this is the first contribution of this paper. This result has been used to provide a traffic model in [30] by same author.
To the best of our knowledge, none of the existing researches explains all of the relevant distributions details in terms of concepts of parameters and theory. Furthermore, the work presented in this paper can be considered as an applicable collection of statistical distributions which are applied in the computer science and it can be regarded as the second contribution of this paper. Nevertheless, to find the best distribution for some data set under specified situations in which none of these distributions cannot be chosen regarding all of statistical criteria, applying some conversion or the mixture distribution for the data will be done as future works.
AppendixA. Proof of Equation (5)
A simple linear order differential equation is described which has the following general form:(A.1)Dyx+fxyx=gx,Equation 5.Both sides of this equation are multiplied by e∫fxdx so we have(A.2)Dyxe∫fxdx+fxyxe∫fxdx=gxe∫fxdx.Based on the product rule in differential analysis, it equals to(A.3)Dyxe∫fxdx=gxe∫fxdx;then(A.4)yxe∫fxdx=∫gxe∫fxdxdx+cyx=∫gxe∫fxdxdx+ce∫fxdx.The assumption (7) comes from the comparison between (6) and (A.5)yx=e-∫fxdx∫gxe∫fxdxdx+k.
B. The Solution of Equation (7)
This appendix covers the solution of (7) in more detail. In case of positive discrimination, there are two real roots like a1 and a2. Therefore, the quadratic function can be rewritten as follows:(B.1)fx=b2x-a1x-a2.Rewriting (7) as(B.2)px∝exp-1b2∫x-ax-a1x-a2dxthen the solution result of the integral is as follows:(B.3)∫x-ax-a1x-a2dx=∫a1-ax-a1a1-a2dx+∫a-a2x-a2a1-a2dx∫x-ax-a1x-a2dx=a1-alnx-a1a1-a2+a-a2lnx-a2a1-a2+c.Considering k=1/b2a1-a2, (7) can be rewritten as p(x)∝x-a1-ka1-ax-a2ka2-a.
C. The Proof of Pearson Distribution Type 1
Appendix C presents the proof of Pearson type 1 for better understanding to the readers. To find the density function of X, it can be written as a linear function of Y and its density function is in [31].(C.1)X=a1+ya2-a1where 0<y<1py∝a1-a2a1yva1-aa2-a1a11-yva2-apy∝ym11-ym2,where m1=a-a1/b2(a1-a2) and m2=a-a2/b2a1-a2 indicate the shape parameter(C.2)X-λa2-a1~βm1+1,m2+1,where λ=μ1-a2-a1m1+1/m1+m2+2a1 is a location parameter and a2-a1 is scale parameter.(C.3)Pearson type 1: fx=Γm1+m2+2Γm1+4Γm1+1x-λa2-a1m11-x-λa2-a1m2.
D. The Proof of Pearson Distribution Type 4
In this case, there are no roots but by defining a new variable as below, the Pearson distribution type 4 is produced and p(x) can be obtained.(D.1)y=x+b12b2,α=4b2b0-b122b2.So if(D.2)hx=b2x2+b1x+b0then hx=b2y2+α2,(D.3)py∝exp-1b2∫y-b1/2b2-ay2+α2dyon the other hand, the integral can be solved as follows:(D.4)∫y-b1/2b2-ay2+α2dy=12lny2+α2-2b2a+b12b2αarctangyα+c0;therefore(D.5)py∝1+y2α2-mexp-varctangyα,where(D.6)m=12b2,v=-2b2a+b12b2α,As it was mentioned, λ was ignored in the first equation for simplicity but here it is included to the equation(D.7)Pearson type 4 Equation 14: px=Γm+v/2i/Γmαβm-1/2,1/221+x-λα2-mexp-varctangx-λα.
Competing Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
This work was supported in part by Universiti Sains Malaysia through Grant no. 304/PNAV/650817/C127.
GuptaR.PulipakaA.SeelingP.KaramL. J.ReissleinM.H.264 coarse grain scalable (CGS) and medium grain scalable (MGS) encoded video: a trace based traffic and quality evaluation201258342843910.1109/tbc.2012.21917022-s2.0-84865338036TanwirS.PerrosH.A survey of VBR video traffic models20131541778180210.1109/surv.2013.010413.000712-s2.0-84888357894KoumarasH.SkianisC.GardikisG.KourtisA.Analysis of H.264 video encoded trafficProceedings of the 5th International Network Conference (INC '05)July 2005Samos, Greece4414482-s2.0-84905233340LinC.-M.ZaoJ. K.PengW.-H.HuC.-C.ChenH.-M.YangC.-K.Bandwidth efficient video streaming based upon multipath SVC multicastingProceedings of the International Wireless Communications and Mobile Computing Conference (IWCMC '08)August 2008Crete Island, Greece40641210.1109/iwcmc.2008.712-s2.0-52949131572SchwarzH.MarpeD.WiegandT.Overview of the scalable video coding extension of the H.264/AVC standard20071791103112010.1109/tcsvt.2007.9055322-s2.0-34748835762UnanueI.UrteagaI.HusemannR.Del SerJ.RoeslerV.RodriguezA.SanchezP.Del Ser LorenteJ.A tutorial on H.264/SVC scalable video coding and its tradeoff between quality, coding efficiency and performance2011chapter 1Rijeka, CroatiaInTech10.5772/19227ShahidZ.ChaumontM.PuechW.Scalable video coding2011InTech10.5772/14739FeldmannA.WhittW.Fitting mixtures of exponentials to long tail distributions to analyze network performance moldesProceedings of the 16th Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM '97)April 1997Kobe, JapanAbateJ.ChoudhuryG. L.WhittW.Waiting-time tail probabilities in queues with long-tail service-time distributions199416331133810.1007/bf011589602-s2.0-21344494520HeymanD. P.TabatabaiA.LakshmanT. V.Statistical analysis and simulation study of video teleconference traffic in ATM networks199221495910.1109/76.1343712-s2.0-0026837808HeymanD.LakshmanT. V.TabatabaiA.HeekeH.Modeling teleconference traffic from VBR video coders3Proceedings of the IEEE International Conference on Communications (ICC '94), Conference Record, ‘Serving Humanity through Communications’ (SUPERCOMM/ICC '94)May 19941744174810.1109/ICC.1994.368735XuS.HuangZ.A gamma autoregressive video model on ATM networks19988213814210.1109/76.6640982-s2.0-0032049867KrunzM.HughesH.A traffic for MPEG-coded VBR streams1995231475510.1145/223586.223592FitzekF. H. P.ReissleinM.MPEG-4 and H.263 video traces for network performance evaluation2001156405410.1109/65.9675962-s2.0-0035510001PoonW.-C.LoK.-T.A refined version of M/G/∞ processes for modelling VBR video traffic200124111105111410.1016/S0140-3664(00)00325-X2-s2.0-0035875946LazarisA.KoutsakisP.PaterakisM.A new model for video traffic originating from multiplexed MPEG-4 videoconference streams2008651517010.1016/j.peva.2007.02.0042-s2.0-35549012037MasiD. M. B.FischerM. J.GarbinD. A.Video frame size distribution analysis2008197486SalahK.Al-HaidariF.OmarM. H.ChaudhryA.Statistical analysis of H.264 video frame size distribution20115141978198610.1049/iet-com.2010.08682-s2.0-80053548499Video Trace Library, http://trace.eas.asu.edu/GarciaJ. A. D.Gutierrez-JaimezR.Matricvariate and matrix multivariate Pearson type II distributionshttps://arxiv.org/abs/1011.5083v1OlshenA. C.Transformations of the pearson type III distribution19389317620010.1214/aoms/1177732309EldertonW. P.JohnsonN. L.1969London, UKCambridge University PressMR0258166HeinrichJ.2004http://www-cdf.fnal.gov/physics/statistics/notes/cdf6820_pearson4.pdfNagaharaY.The PDF and CF of Pearson type IV distributions and the ML estimation of the parameters199943325126410.1016/S0167-7152(98)00265-XZBL0930.620132-s2.0-0039350903MyungI. J.Tutorial on maximum likelihood estimation20034719010010.1016/S0022-2496(02)00028-7MR1982757ZBL1023.621122-s2.0-0038172395KeltonW. D.LawA. M.2000Boston, Mass, USAMcGraw-HillMR630193CasellaG.BergerR. L.Statistical inference20019AsaiM.DashzevegU.A distribution-free test for symmetry with an application to S&P index returns200815646146410.1080/135048506007064382-s2.0-42549092363StockwellD.2007Boca Raton, Fla, USAChapman & Hall/CRC Taylor & FrancisChapman & Hall/CRC Mathematical and Computational Biology SeriesMR2374574AhmadpourS.ToghrayeeZ.WanT.-C.An extended discrete autoregressive model for variable bit rate video traffic encoded by scalable video codec201528182239225410.1002/dac.30112-s2.0-84958603197PearsonK.Mathematical contributions to the theory of evolution.—X. Supplement to a memoir on skew variation190168442–45037237310.1098/rspl.1901.0060