1. 360b3d3 merge conflict and flaky test by Tim Allison · 2 days ago main
  2. 8870bf9 TIKA-4755 - extra jars (#2880) by Tim Allison · 3 days ago
  3. 2ce13d8 make mojibuster default (#2882) by Tim Allison · 3 days ago
  4. 55655ed TIKA-4727 -- catch ioobe (#2885) by Tim Allison · 3 days ago
  5. 9eff319 TIKA-4745 - small twiddle on charset detection (#2886) by Tim Allison · 3 days ago
  6. 333d281 small updates to tika-eval (#2881) by Tim Allison · 3 days ago
  7. 8a7728a Bump jackson.version from 2.21.4 to 2.22.0 (#2883) by dependabot[bot] · 4 days ago
  8. b8cf60d Bump com.fasterxml.woodstox:woodstox-core from 7.2.0 to 7.2.1 (#2884) by dependabot[bot] · 4 days ago
  9. 8d9900e TIKA-4745 -- efficiency improvements (#2878) by Tim Allison · 6 days ago
  10. 1849dc7 TIKA-4750 - improve docs (#2879) by Tim Allison · 6 days ago
  11. c620f31 TIKA-4327: update sqlite, zstd, jacoco by Tilman Hausherr · 6 days ago
  12. 110fc5f TIKA-4327: update google-auth-library-oauth2-http, aws2, google cloud, biz.aqute, jna, kotlin, micronaut by Tilman Hausherr · 6 days ago
  13. a23433c TIKA-4663 - make markdown the default content handler in tika-app, tika-server, and the async CLI (#2877) by Tim Allison · 7 days ago
  14. de9433e TIKA-4754 -- move to bloom filters for common_tokens (#2876) by Tim Allison · 7 days ago
  15. 67ada35 improve isatab parsing (#2875) by Tim Allison · 7 days ago
  16. 327eda1 TIKA-4753 - improve oom/timeout/crash msg (#2870) by Tim Allison · 7 days ago
  17. d1e81e3 TIKA-4745-follow-on-junk-improvements (#2872) by Tim Allison · 7 days ago
  18. d66da4f TIKA-4752-follow-up (#2871) by Tim Allison · 7 days ago
  19. 945378a improve hssf parsing (#2874) by Tim Allison · 7 days ago
  20. 88a6fc5 TIKA-4752 -- improve zip name detection (#2869) by Tim Allison · 7 days ago
  21. ffd7129 TIKA-4750 - improve error msg when component not on classpath (#2868) by Tim Allison · 7 days ago
  22. 127665a TIKA-4751 - decode as (#2867) by Tim Allison · 7 days ago
  23. cf7d87a TIKA-4747 -- add axml detection (#2865) by Tim Allison · 8 days ago
  24. 4ce5c70 TIKA-4745 - charset/junk/tika-eval improvements (#2861) by Tim Allison · 8 days ago
  25. ce700b6 TIKA-4221 - tmp workaround for pack200 (#2863) by Tim Allison · 8 days ago
  26. 48257e3 TIKA-4748 -- clean up ocr configuration within pdfparser (#2864) by Tim Allison · 8 days ago
  27. ecbccdd TIKA-4747 -- improve pdf and ocr/imagemagick docs. Make sure to include default-parser (#2862) by Tim Allison · 9 days ago
  28. 363378f TIKA-4749 - improve inline handling of metadata only (#2866) by Tim Allison · 9 days ago
  29. 681f9e8 TIKA-4327: update jaxb, micronaut, netty, jackrabbit by Tilman Hausherr · 10 days ago
  30. 1db9aa1 TIKA-4746 -- sweep docs (#2852) by Tim Allison · 11 days ago
  31. 45eb97c TIKA-4743 -- fix search result links (#2860) by Tim Allison · 11 days ago
  32. 66acc31 Bump org.apache.maven.plugins:maven-failsafe-plugin from 3.5.5 to 3.5.6 (#2853) by dependabot[bot] · 11 days ago
  33. 2881dff Bump jackson.version from 2.21.3 to 2.21.4 (#2854) by dependabot[bot] · 11 days ago
  34. c14fbe7 Bump org.apache.maven.plugins:maven-dependency-plugin (#2855) by dependabot[bot] · 11 days ago
  35. 8c13afa Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.5 to 3.5.6 (#2856) by dependabot[bot] · 11 days ago
  36. 72f5182 Bump com.diffplug.spotless:spotless-maven-plugin from 3.5.1 to 3.6.0 (#2857) by dependabot[bot] · 11 days ago
  37. 2c97670 Bump software.amazon.awssdk:bom from 2.44.12 to 2.45.1 (#2858) by dependabot[bot] · 11 days ago
  38. 3be5f8f Bump com.nimbusds:nimbus-jose-jwt from 10.9 to 10.9.1 (#2859) by dependabot[bot] · 11 days ago
  39. 6c36ae1 TIKA-4327: update tess4j by Tilman Hausherr · 13 days ago
  40. 81d75a8 TIKA-4744 (#2850) by Tim Allison · 14 days ago
  41. 89de688 improve unpack endpoint (#2851) by Tim Allison · 14 days ago
  42. 7478ffd TIKA-4743 improve search and navigation on site (#2845) by Tim Allison · 14 days ago
  43. 499e703 TIKA-4745 - add cohort-specific caps (#2848) by Tim Allison · 2 weeks ago
  44. ba501e6 fix race condition in PipesClient (#2849) by Tim Allison · 2 weeks ago
  45. acab9b7 TIKA-4742 -- refactor logging for beta-1 (#2844) by Tim Allison · 2 weeks ago
  46. 2bab6d4 TIKA-4734 (#2843) by Tim Allison · 2 weeks ago
  47. cc3cae8 drop chunking (#2847) by Tim Allison · 2 weeks ago
  48. cabd1f2 fix potential sax dos (#2838) by Tim Allison · 2 weeks ago
  49. 4f6ad8b TIKA-4739 (#2837) by Tim Allison · 2 weeks ago
  50. a2bc351 TIKA-4731 - improve charset detection and junk detection (#2839) by Tim Allison · 2 weeks ago
  51. d02dc13 TIKA-4740 -- tika-server-core fix (#2841) by Tim Allison · 2 weeks ago
  52. 1abcd65 TIKA-4740 -- update docs by tallison · 2 weeks ago
  53. 0cbdb26 TIKA-4740 -- fix flaky windows test by tallison · 2 weeks ago
  54. 4bfbdf2 TIKA-4737 -- improve docs for tika-pipes via tika-app (#2836) by Tim Allison · 2 weeks ago
  55. cfddd1a Bump eu.maveniverse.maven.nisse:extension from 0.9.0 to 0.9.2 (#2831) by dependabot[bot] · 3 weeks ago
  56. 3b1f68e Bump org.ow2.asm:asm from 9.10 to 9.10.1 (#2830) by dependabot[bot] · 3 weeks ago
  57. 29f287f Bump org.apache.maven.plugins:maven-site-plugin from 3.21.0 to 3.22.0 (#2832) by dependabot[bot] · 3 weeks ago
  58. 3622658 Bump software.amazon.awssdk:bom from 2.44.10 to 2.44.12 (#2833) by dependabot[bot] · 3 weeks ago
  59. 1aea9db Bump org.apache.kafka:kafka-clients from 4.2.0 to 4.3.0 (#2834) by dependabot[bot] · 3 weeks ago
  60. 795f30c Bump com.microsoft.graph:microsoft-graph from 6.64.0 to 6.65.0 (#2835) by dependabot[bot] · 3 weeks ago
  61. 933fb96 Bump com.github.luben:zstd-jni from 1.5.7-8 to 1.5.7-9 (#2829) by dependabot[bot] · 3 weeks ago
  62. 8ef279d TIKA-4327: update aws, netty, woodstox, plexus by Tilman Hausherr · 3 weeks ago
  63. 4b66205 TIKA-4736 -- image extraction fails (#2828) by Tim Allison · 3 weeks ago
  64. 19b4c66 TIKA-4728 - fix xhtml in widgets (#2817) by Tim Allison · 3 weeks ago
  65. c2b15c9 TIKA-4733 -- fix docker-snapshot.yml to match new release zip artifacts (#2827) by Tim Allison · 3 weeks ago
  66. 0b38268 TIKA-4735 -- fix content-only (#2826) by Tim Allison · 3 weeks ago
  67. da1801a TIKA-4733 -- improve release artifact robustness and documentation (#2825) by Tim Allison · 3 weeks ago
  68. 52530af TIKA-4327: update junit6 by Tilman Hausherr · 3 weeks ago
  69. 230d635 TIKA-4327: update aws, jbig2 by Tilman Hausherr · 3 weeks ago
  70. 5fb9402 TIKA-4327: update aws, enforcer plugin, azure by Tilman Hausherr · 3 weeks ago
  71. a803c16 TIKA-4732 (#2820) by Tim Allison · 4 weeks ago
  72. ae9adc3 Bump eu.maveniverse.maven.nisse:extension from 0.8.4 to 0.9.0 (#2823) by dependabot[bot] · 4 weeks ago
  73. a5c6b87 Bump org.codehaus.plexus:plexus-classworlds from 2.10.0 to 2.11.0 (#2821) by dependabot[bot] · 4 weeks ago
  74. b34a066 Bump org.apache.maven:maven-model from 3.9.15 to 3.9.16 (#2822) by dependabot[bot] · 4 weeks ago
  75. 39b0795 TIKA-4327: add comment by Tilman Hausherr · 4 weeks ago
  76. 168c064 TIKA-4327: remove dependencies that are in parent by Tilman Hausherr · 4 weeks ago
  77. a15d126 TIKA-4327: update aws, junrar, swagger, spotless by Tilman Hausherr · 4 weeks ago
  78. d19d3d4 Set up default protection ruleset for default and release branches (#2819) by The Apache Software Foundation · 4 weeks ago
  79. 000c6b4 TIKA-4327: update aws, asm by Tilman Hausherr · 4 weeks ago
  80. 69c2c80 TIKA-4327: update slf4j by Tilman Hausherr · 4 weeks ago
  81. 465bc76 junk-detector-v6 (#2818) by Tim Allison · 4 weeks ago
  82. de9ea3d add tika-eval into tika-app (#2816) by Tim Allison · 4 weeks ago
  83. c136fa0 TIKA-4727 -- Small tweaks: improve embedded file name handling and add pagination in hslf (#2812) by Tim Allison · 4 weeks ago
  84. 5463cd6 TIKA-4727: improve parsemode configuration (#2815) by Tim Allison · 4 weeks ago
  85. 63cbdd7 TIKA-4727: improvements for vlm (#2814) by Tim Allison · 4 weeks ago
  86. 6af5518 improve jina-integration (#2813) by Tim Allison · 4 weeks ago
  87. 3bbf65c TIKA-4723 follow-up: fix sqlite3 shade filter and correct docs (#2810) by Nicholas DiPiazza · 4 weeks ago
  88. a9b2a86 TIKA-4723 (#2809) by Tim Allison · 4 weeks ago
  89. 0b65830 TIKA-4725 - updates to docker release workflow (#2807) by Tim Allison · 4 weeks ago
  90. cf1a656 docs/pipes-updates (#2808) by Tim Allison · 4 weeks ago
  91. c297247 [TIKA-4724] Use IANA-registered text/markdown as primary type (#2806) by Shawn Rutledge · 5 weeks ago
  92. 828e5b1 TIKA-4327: update groovy-all by Tilman Hausherr · 5 weeks ago
  93. 2c0773f TIKA-4327: revert update groovy-all by Tilman Hausherr · 5 weeks ago
  94. 93434d9 TIKA-4327: update groovy-all by Tilman Hausherr · 5 weeks ago
  95. e220372 Bump org.apache:apache from 37 to 38 (#2805) by dependabot[bot] · 5 weeks ago
  96. dc7514b TIKA-4327: update aws, log4j, microsoft-graph by Tilman Hausherr · 5 weeks ago
  97. 2ef4f7d TIKA-4327: update aws, log4j by Tilman Hausherr · 5 weeks ago
  98. b723055 Add maxPages option to PDFParserConfig to limit page processing (#2803) by Julien Nioche · 5 weeks ago
  99. 381184f TIKA-4327: update sqlite by Tilman Hausherr · 5 weeks ago
  100. 6167f10 TIKA-4327: update google-auth-library-oauth2-http, google cloud, aws by Tilman Hausherr · 5 weeks ago