1. 9856d5e Merge pull request #29 from apache/nullable_theta by Alexander Saydakov · 5 months ago main
  2. 0ed6a12 attempt to fix codegen by AlexanderSaydakov · 5 months ago
  3. 69361b0 support nullable by AlexanderSaydakov · 5 months ago
  4. 3d9e087 Merge pull request #28 from apache/as_binary by Jon Malkin · 5 months ago
  5. 54155d1 Finish renaming by Jon · 5 months ago
  6. 3a4e900 rename cast_to_binary to cast_as_binary by Jon · 5 months ago
  7. 7b60f9d Merge pull request #27 from apache/to_binary by Jon Malkin · 5 months ago
  8. a0b6589 fix prettyName and remove commented-out example by Jon · 5 months ago to_binary
  9. be83fd5 add function to cast sketches to BinaryType to handle spark weirdness by Jon · 6 months ago
  10. 8437894 Merge pull request #26 from apache/test_improvements by Jon Malkin · 6 months ago
  11. 5a015cd use correct name for codegen_only by Jon · 6 months ago
  12. 6baf924 Should improve codegen on/off control, force use of both in workflow by Jon · 6 months ago
  13. 04cd693 Merge pull request #25 from apache/test_codegen by Jon Malkin · 6 months ago
  14. 20bfb8d Add explicit test forcing the use of codegen for UDFs which provide it by Jon · 6 months ago
  15. d127655 Merge pull request #24 from apache/tidy-example by Jon Malkin · 6 months ago
  16. 9ce390b use bucket name variable by Jon Malkin · 6 months ago
  17. acf0fe9 tidy example by Jon Malkin · 6 months ago
  18. 647c1f6 Merge pull request #23 from apache/codegen_fix by Jon Malkin · 6 months ago
  19. f3e5b47 Clean up codegen to avoid exceptions and allow that processing path by Jon · 6 months ago
  20. f7da381 Merge pull request #21 from apache/theta_params by Jon Malkin · 6 months ago
  21. 9c6e455 fixed return type by AlexanderSaydakov · 6 months ago
  22. e8f8cf6 Merge pull request #22 from apache/airflow_example by Jon Malkin · 6 months ago
  23. 1c0d692 Add a partial example of using the library with airflow+dataproc by Jon Malkin · 6 months ago
  24. ffe0066 theta functions with parameters by AlexanderSaydakov · 6 months ago
  25. e3f9503 added params to theta build agg by AlexanderSaydakov · 7 months ago
  26. eac95b8 Merge pull request #20 from apache/pyspark_java17 by Jon Malkin · 7 months ago
  27. 9e9cb79 Run pyspark CI for both java 11 and 17 by Jon Malkin · 7 months ago
  28. 4b55fba Update readmes by Jon Malkin · 7 months ago
  29. 2d0096d Fix get_pmf/cdf codegen not using proper naming, and add support for java17 in pytest (we hope) by Jon Malkin · 7 months ago
  30. a4387a3 Merge pull request #19 from apache/python by Jon Malkin · 7 months ago
  31. 0fd0d61 Minor fix: use | for multi-line commands by Jon Malkin · 7 months ago
  32. 6e347e7 Minor fix: better naming by Jon Malkin · 7 months ago
  33. 02ed084 Finished moving edits -- sbt test in scala, sbt package in python by Jon Malkin · 7 months ago
  34. dbe59e1 Rename Python CI job to disambiguation, put edits in correct file by Jon Malkin · 7 months ago
  35. 00ad689 Try to fix main ci workflow warnings, add python ci workflow by Jon Malkin · 7 months ago
  36. 55fc7d5 Switch from isBlank() to isEmpty() so java8 works by Jon Malkin · 7 months ago
  37. 6357cdd Update readmes with build instructions by Jon Malkin · 7 months ago
  38. 50794a0 improve kll merge test by Jon Malkin · 7 months ago
  39. b1ebc1f Tidy up kll python tests a bit by Jon Malkin · 7 months ago
  40. 70742f2 Remove deprecation warning for reading package resource files by Jon · 7 months ago
  41. 8945522 Ensure python deps path exists before copying into it by Jon · 7 months ago
  42. 71bc84e final(?) changes for baseline kll in pyspark, including useful tests by Jon Malkin · 7 months ago
  43. a9144fe finish setting up config to support pyspark package, inclduing methods to simplify use by Jon Malkin · 7 months ago
  44. 0858aa7 minor cleanup, add TODO for tox by Jon Malkin · 7 months ago
  45. d63c5aa cleaner build.sbt that better interacts with setting up python. many changes to make python build work for wheel or sdist by Jon Malkin · 7 months ago
  46. 4e8d151 WIP: modified build.sbt to copy ds-java jars, added setup.py to include in sdist/wheels. by Jon Malkin · 7 months ago
  47. a432672 Clean up codegen by setting ev.isNull properly by Jon Malkin · 7 months ago
  48. b6940ee fix misplaced paren in _array_as_java_column by Jon Malkin · 7 months ago
  49. a1dd278 WIP: Initial files for pyspark testing. Not fully working but checkpointing here by Jon Malkin · 7 months ago
  50. 255b048 Merge pull request #18 from apache/rename_packages by Jon Malkin · 7 months ago
  51. 050710f Rename theta packages, too by Jon Malkin · 7 months ago
  52. c44364a Add datasketches directory to hierarchy by Jon Malkin · 7 months ago
  53. 7fd4817 Rename packages to use org.apache.spark.sql.datasketches and update imports as needed by Jon Malkin · 7 months ago
  54. 6779cde Merge pull request #17 from apache/spark3.3_compat by Jon Malkin · 7 months ago
  55. 7b311b2 Compatibility with spark 3.3 by Jon Malkin · 7 months ago
  56. eb7b4b2 Merge pull request #16 from apache/scala_compat by Jon Malkin · 8 months ago
  57. 28de4b1 scalaVersion seems to be a predefined key. Try passing in to sbt script but allow default by Jon Malkin · 8 months ago
  58. 7f6f1a6 properly declare default variables, better name for workflow by Jon Malkin · 8 months ago
  59. 9b6ab61 Rename KllExpressions file, make compatible with scala 2.12 and 2.13 by Jon Malkin · 8 months ago
  60. d8da04a Merge pull request #15 from apache/kll_rename by Jon Malkin · 8 months ago
  61. 7f88746 Fix typo in name by Jon Malkin · 8 months ago
  62. ffe0be1 rename KLL aggregate files by Jon Malkin · 8 months ago
  63. afc1d8d Merge pull request #14 from jmalkin/ci_workflow by Jon Malkin · 8 months ago
  64. 9663a21 add jdk and spark versions to cache by Jon Malkin · 8 months ago
  65. c22065f use env variable for spark version, default to 3.5.4 by Jon Malkin · 8 months ago
  66. 1a86eb7 remove unnecessary cat of workflow by Jon Malkin · 8 months ago
  67. 323e937 attempt #1 at ci workflow by Jon Malkin · 8 months ago
  68. 5e194b3 Merge pull request #13 from apache/renaming by Jon Malkin · 8 months ago
  69. e1650fe Rename underlying scala classes and public function names for more consistent style by Jon Malkin · 8 months ago
  70. 395c55e Merge pull request #12 from apache/restructure by Jon Malkin · 8 months ago
  71. c2d8ac0 align aggregation function naming with BQ style by Jon Malkin · 8 months ago
  72. 6384c36 rename scala/SQL methods to use current standard, rename internal classes to follow java standard. move to jdk17 by Jon Malkin · 8 months ago
  73. 574ed07 Restructure repo as a proposal for future managability by Jon Malkin · 8 months ago
  74. 50220f6 Merge pull request #11 from apache/fixes_after_conflict by Jon Malkin · 8 months ago
  75. 84e010f fixed problems introduced while resolving a conflict by AlexanderSaydakov · 8 months ago
  76. 2cc419b Merge pull request #7 from apache/theta by Alexander Saydakov · 8 months ago
  77. 923c640 Merge branch 'main' into theta by Alexander Saydakov · 8 months ago
  78. 867c6c8 Merge pull request #10 from apache/renaming by Jon Malkin · 8 months ago
  79. 8d07a4c Merge pull request #9 from apache/kll_merge by Jon Malkin · 8 months ago
  80. 8b8d855 Rename function inputs to something useful, define e.g. left and right elsewhere to satisfy interface API by Jon Malkin · 8 months ago
  81. 5102ed5 Update KllMerge.scala by Jon Malkin · 8 months ago
  82. b4c2478 Define more merge constructors to fix SQL multi-argument issue by Jon Malkin · 8 months ago
  83. ea0c699 Update kll merge to accept k or fall back toa default value. SQL currently complains if specifying k by Jon Malkin · 8 months ago
  84. de14d0e Merge pull request #8 from apache/simplify_registration by Jon Malkin · 8 months ago
  85. 1944a2b Merge pull request #6 from apache/cleanup by Jon Malkin · 8 months ago
  86. c689e44 Clean up codegen errors a bit -- still one for pmf/cdf but fallback works ok by Jon Malkin · 8 months ago
  87. 455dcde Two unrelated changes. Add type-checking to k in sketch creation, and modify pmf/cdf to allow simpler registration for SQL by Jon Malkin · 8 months ago
  88. b8c806f Merge branch 'main' of github.com:apache/datasketches-spark by Jon Malkin · 8 months ago
  89. 5addc8d fixed union by AlexanderSaydakov · 8 months ago
  90. ed9ef30 theta prototype by AlexanderSaydakov · 8 months ago
  91. f6c79bc Remove now-deleted wrapper from codegen for expressions by Jon · 8 months ago
  92. 3923c57 Move funtion registration from trait to accompanying object by Jon · 8 months ago
  93. 1669753 Fix typo by Jon · 8 months ago
  94. 70e8349 Merge pull request #3 from apache/remove_wrapper by Jon Malkin · 8 months ago
  95. 636ee8c Merge pull request #2 from apache/import_kll by Jon Malkin · 8 months ago
  96. 6d22649 Clean up references to wrapper and remove that class by Jon · 8 months ago
  97. 1a9bc31 Remove need for extra wrapper around sketch, although still a bunch to clean up by Jon · 8 months ago
  98. 7e001c1 Add udf to import sketch images from outside spark into a dataframe by Jon · 9 months ago
  99. 712f213 Merge pull request #1 from apache/docs by Jon Malkin · 9 months ago
  100. 208c13d fix wording on a few variables by Jon Malkin · 9 months ago