I don't think this is cronet-defined behaviour; the networking stack just gets the bytes to the rendering engine, which is responsible for interpreting them. The interpretation depends to a large extent on the content type of the resource (or the purpose of the resource, if we have to guess the content type, too). HTML resources should largely follow the spec here:
https://html.spec.whatwg.org/#determining-the-character-encoding . Note, though, that this algorithm starts off deterministically, but relies heavily on heuristics and implementation-defined behaviour by the end.
XML responses, JSON responses, and other text formats all likely use a different pathway.