Hello Eyal,
it usually happens due to packet disorder as Ray said.
This disorder can be cause by network conditions and in the case of the video because of retransmissions.
Take a look at [1], but taking into account that in the case of WebRTC there must not be duplicates because they are dropped when SRTP unprotecting.
Refs
[1]
https://tools.ietf.org/html/rfc3550#section-6.4.1 cumulative number of packets lost: 24 bits
The total number of RTP data packets from source SSRC_n that have
been lost since the beginning of reception. This number is
defined to be the number of packets expected less the number of
packets actually received, where the number of packets received
includes any which are late or duplicates. Thus, packets that
arrive late are not counted as lost, and the loss may be negative
if there are duplicates. The number of packets expected is
defined to be the extended last sequence number received, as
defined next, less the initial sequence number received.