Skip to content

Conversation

@pomo-mondreganto
Copy link

Per the Avro specification, when reading maps (and arrays), a negative block count indicates that the absolute value should be used as the count, and a byte size follows for fast skipping.

Previously, the code cast the signed zigzag value directly to usize, causing a negative value like -8 to become 18,446,744,073,709,551,608 on 64-bit systems, triggering a hash table capacity overflow panic.

Also skips parsing of 'default' field values since the current implementation incorrectly expects them to be Schema types rather than actual default values.

Fixes reading of Apache Iceberg manifest files which use this encoding.

Per the Avro specification, when reading maps (and arrays), a negative
block count indicates that the absolute value should be used as the
count, and a byte size follows for fast skipping.

Previously, the code cast the signed zigzag value directly to usize,
causing a negative value like -8 to become 18,446,744,073,709,551,608
on 64-bit systems, triggering a hash table capacity overflow panic.

Also skips parsing of 'default' field values since the current
implementation incorrectly expects them to be Schema types rather
than actual default values.

Fixes reading of Apache Iceberg manifest files which use this encoding.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant