AVRO-2648: Incorrect validation of numeric default values

Validation of numeric default values in Java is incorrect and results
in API inconsistencies. Consider the following examples:

Double values as int field default values:
public void testDoubleAsIntDefaultValue() {
  Schema.Field field = new Schema.Field("myField",
          Schema.create(Schema.Type.INT), "doc", 1.1);
  field.hasDefaultValue(); // true
  field.defaultValue(); // internal DoubleNode (1.1)
  field.defaultVal(); // null
  GenericData.get().getDefaultValue(field); // Integer (1)

  field = new Schema.Field("myField",
          Schema.create(Schema.Type.INT), "doc", 1.0);
  field.hasDefaultValue(); // true
  field.defaultValue(); // internal DoubleNode (1.0)
  field.defaultVal(); // null
  GenericData.get().getDefaultValue(field); // Integer (1)
}

Invalid long value as int field default value:
public void testInvalidLongAsIntDefault() {
  Schema.Field field = new Schema.Field("myField",
         Schema.create(Schema.Type.INT), "doc", Integer.MAX_VALUE + 1L);
  field.hasDefaultValue(); // true
  field.defaultValue(); // internal LongNode (2147483648)
  field.defaultVal(); // Long (2147483648)
  GenericData.get().getDefaultValue(field); // Integer (-2147483648)
}

This PR makes changes to invalidate incorrect default values for INT and
LONG schemas, including all floating point values, e.g. 1.0.
Additionally it contains changes to try and return the appropriate
Object type given the schema type.

This change is necessary for correctness and consitency but also
because users cannot disable default value validation and handle
these cases on their own since the underlying Field.defaultValue()
is no longer public. Users only have access to default values
mutated by Field.defaultVal() and GenericData.getDefaultValue().

Notes on JacksonUtils.toObject():
 - This method is used to convert the underlying JsonNode default value
  to an Object when Field.defaultVal() is called. This method is
  invoked regardless of whether default value validation is true or
  false.
 - For LongNode values we continue to return Long values for INT
   schemas in the case we cannot safely convert to an Integer.
   This behavior, while maintained, is inconsistent with that
   of FloatNode / DoubleNode where null is returned for INT
   and LONG schemas. Additional changes may be needed for
   further consistency.

Closes #739
(cherry picked from commit 8b5c11ade7f73eb74a0f017bd52b9b485d68d42f)
3 files changed
tree: b12a2da47c2136ea324863bbf296d050341eb97c
  1. .github/
  2. .travis/
  3. doc/
  4. lang/
  5. share/
  6. .asf.yaml
  7. .editorconfig
  8. .gitignore
  9. .travis.yml
  10. .yamllint.yml
  11. BUILD.md
  12. build.sh
  13. composer.json
  14. DIST_README.txt
  15. LICENSE.txt
  16. NOTICE.txt
  17. pom.xml
  18. README.md
README.md

Build Status

Apache Avro™

Apache Avro™ is a data serialization system.

Learn more about Avro, please visit our website at:

https://avro.apache.org/

To contribute to Avro, please read:

https://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute