1. 1cafe16 TIKA-4327: update grpc, bind, animal-sniffer-annotations by Tilman Hausherr · 3 hours ago main
  2. 570a3b4 TIKA-4650-refactor-zip-parser (#2584) by Tim Allison · 16 hours ago
  3. 7d7cdb5 TIKA-4651 -- refactor cli to us pipes for parser (#2586) by Tim Allison · 24 hours ago
  4. 9f7e4dc TIKA-4630 -- use embedded stored filename as the "resourcename" in gz (#2582) by Tim Allison · 3 days ago
  5. 59af7f3 TIKA-4648 -- add standard mvn repo and general ASF repo items (#2580) by Tim Allison · 3 days ago
  6. 2b9dc0e TIKA-4617 - really, I mean it, don't change the file name (#2581) by Tim Allison · 3 days ago
  7. d8ee89b TIKA-4647 - use an argfile to launch PipesServer (#2579) by Tim Allison · 3 days ago
  8. bef2d33 TIKA-4646 -- extract hyperlinks from instrText and other areas in ooxml(#2578) by Tim Allison · 3 days ago
  9. 1c06d30 TIKA-4645-usability-scripts and bug fixes (#2577) by Tim Allison · 4 days ago
  10. 46b002d TIKA-4645 - part 2, general updates for alpha release -- update docs by tallison · 4 days ago
  11. d911051 Bump com.google.cloud:google-cloud-storage from 2.62.0 to 2.62.1 (#2568) by dependabot[bot] · 4 days ago
  12. f650583 Bump org.pf4j:pf4j from 3.14.1 to 3.15.0 (#2573) by dependabot[bot] · 4 days ago
  13. f91ec64 Bump sis.version from 1.5 to 1.6 (#2569) by dependabot[bot] · 4 days ago
  14. 16d2c76 Bump org.apache.maven.plugins:maven-compiler-plugin (#2572) by dependabot[bot] · 4 days ago
  15. 9edac07 Bump com.diffplug.spotless:spotless-maven-plugin from 3.2.0 to 3.2.1 (#2570) by dependabot[bot] · 4 days ago
  16. b8f4b1a Bump software.amazon.awssdk:bom from 2.41.14 to 2.41.19 (#2571) by dependabot[bot] · 4 days ago
  17. 26fc000 Bump commons-codec:commons-codec from 1.20.0 to 1.21.0 (#2574) by dependabot[bot] · 4 days ago
  18. a399aa0 Bump com.azure:azure-sdk-bom from 1.3.3 to 1.3.4 (#2576) by dependabot[bot] · 4 days ago
  19. ec02aeb TIKA-4641 -- step 2: refactor serialization, further. add docs (#2567) by Tim Allison · 5 days ago
  20. 778acde TIKA-4643 -- add frictionless by tallison · 5 days ago
  21. acca2fe TIKA-4637 (#2565) add UNPACK option for tika-pipes and integrate it in tika-app and tika-server by Tim Allison · 5 days ago
  22. 86857ce TIKA-4644 - improve config endpoints (#2566) by Tim Allison · 5 days ago
  23. 8e69493 TIKA-4642 - improve tls configuration and documentation (#2564) by Tim Allison · 6 days ago
  24. dcb1ca0 tika-server-simplify-tests (#2563) by Tim Allison · 6 days ago
  25. cd26547 TIKA-4641 (#2562) by Tim Allison · 7 days ago
  26. 00954ff TIKA-4640 -- use ephemeral port for unit tests in tika-server (#2560) by Tim Allison · 7 days ago
  27. f73ea2f TIKA-4639 (#2559) by Tim Allison · 8 days ago
  28. 589d1c2 TIKA-4636-simplify-embedded-extractor-handling (#2558) by Tim Allison · 8 days ago
  29. 222e085 TIKA-4638 -- unify sax style configuration (#2557) by Tim Allison · 8 days ago
  30. 48ca355 TIKA-4633 centralize limits (#2556) by Tim Allison · 9 days ago
  31. bd15136 TIKA-4635 -- refactor DigesterFactory to be standalone (#2555) by Tim Allison · 9 days ago
  32. 766cf2c TIKA-4634 -- refactor metadata write filter/limiter (#2554) by Tim Allison · 9 days ago
  33. bb785a2 Bump com.diffplug.spotless:spotless-maven-plugin from 3.1.0 to 3.2.0 (#2552) by dependabot[bot] · 11 days ago
  34. 7263844 TIKA-4327: update aws, google auth, google http by Tilman Hausherr · 13 days ago
  35. 33e1f6a rm asciidoc plugin and resources by tallison · 14 days ago
  36. d0f76be Merge remote-tracking branch 'origin/main' by tallison · 14 days ago
  37. bd1c7ed TIKA-4630-on-main (#2551) by Tim Allison · 14 days ago
  38. 1c078b4 TIKA-4632 -- initial antora integration (#2550) by Tim Allison · 14 days ago
  39. 30b5b26 TIKA-4628 -- improve pipesClient+pipesServer ipc: critical socket.setTcpNoDelay(true) and migrate to pure jackson serialization (#2546) by Tim Allison · 14 days ago
  40. d7781ec TIKA-4630 -- improve tracking of internal paths (#2548) by Tim Allison · 14 days ago
  41. 231ac69 TIKA-4625: Add AsciiDoc documentation module (#2536) by Tim Allison · 14 days ago
  42. 19eb316 TIKA-4631 -- add a detect/no-parse option to pipes (#2549) by Tim Allison · 2 weeks ago
  43. 5e21b45 TIKA-4626 (#2545) by Tim Allison · 2 weeks ago
  44. 1c2b1eb Bump reactor.netty.version from 1.3.1 to 1.3.2 (#2538) by dependabot[bot] · 3 weeks ago
  45. 3f8577e Bump com.google.cloud:google-cloud-storage from 2.61.0 to 2.62.0 (#2542) by dependabot[bot] · 3 weeks ago
  46. f1489ec Bump org.netpreserve:jwarc from 0.33.0 to 0.34.0 (#2537) by dependabot[bot] · 3 weeks ago
  47. dabb1f1 Bump com.fasterxml.jackson:jackson-bom from 2.20.1 to 2.21.0 (#2544) by dependabot[bot] · 3 weeks ago
  48. 000e3df Bump io.projectreactor:reactor-core from 3.8.1 to 3.8.2 (#2540) by dependabot[bot] · 3 weeks ago
  49. 4b4ebbd Bump org.codehaus.mojo:versions-maven-plugin from 2.20.1 to 2.21.0 (#2541) by dependabot[bot] · 3 weeks ago
  50. 4190d7d Bump software.amazon.awssdk:bom from 2.41.5 to 2.41.10 (#2543) by dependabot[bot] · 3 weeks ago
  51. 53b0cdb Bump org.springframework:spring-context from 7.0.2 to 7.0.3 (#2539) by dependabot[bot] · 3 weeks ago
  52. 066412e WIP: Checkpoint - CachingSource metadata update and cleanup (#2535) by Tim Allison · 3 weeks ago
  53. 5f9a808 TIKA-4623 -- for general updates, don't buffer unless enableRewind has been set (#2534) by Tim Allison · 3 weeks ago
  54. caefbbf add rat check to the primary workflow by tallison · 3 weeks ago
  55. 5762c59 fix rat by tallison · 3 weeks ago
  56. a34d52d TIKA-4618 -- improve spooling strategy configuration (#2533) by Tim Allison · 3 weeks ago
  57. bc93db3 TIKA-4619 (#2531) by Tim Allison · 3 weeks ago
  58. c72dbc9 improve resource clean up -- directories from PipesClient (#2532) by Tim Allison · 3 weeks ago
  59. 9751122 TIKA-4622: Add test for PDF annotations without page content streams (#2530) by Tilman Hausherr · 3 weeks ago
  60. c43f000 TIKA-4616 -- mv fetcher/emitter CRUD to tika-grpc (#2519) by Tim Allison · 3 weeks ago
  61. 48f5991 TIKA-4621: avoid NPE (#2528) by Tilman Hausherr · 3 weeks ago
  62. b8ae8ab TIKA-4327: update apache parent (#2527) by Tilman Hausherr · 3 weeks ago
  63. 786d682 TIKA-4620: avoid NPE (#2526) by Tilman Hausherr · 3 weeks ago
  64. 2f301ac Bump software.amazon.awssdk:bom from 2.41.4 to 2.41.5 (#2521) by dependabot[bot] · 4 weeks ago
  65. 9b26790 Bump com.microsoft.graph:microsoft-graph from 6.59.0 to 6.60.0 (#2523) by dependabot[bot] · 4 weeks ago
  66. 029fb68 Bump com.google.errorprone:error_prone_annotations from 2.45.0 to 2.46.0 (#2524) by dependabot[bot] · 4 weeks ago
  67. 1b87c57 Bump com.nimbusds:nimbus-jose-jwt from 10.6 to 10.7 (#2525) by dependabot[bot] · 4 weeks ago
  68. 408c26e TIKA-4612 -- improve mp3 and aac detection (#2520) by Tim Allison · 4 weeks ago
  69. 3f15e82 TIKA-4615 -- rm junit 4 and update testcontainers (#2517) by Tim Allison · 4 weeks ago
  70. 23d2109 TIKA-4327: update aws, jackrabbit, pf4j by Tilman Hausherr · 4 weeks ago
  71. 2f7b46d TIKA-4614: add Media Management metadata extraction, avoid NPE, add test by Tilman Hausherr · 4 weeks ago
  72. 64b411d TIKA-4613 -- look for jsonconfig constructor, fall back to no-arg (#2516) by Tim Allison · 4 weeks ago
  73. 697d7c0 TIKA-4327: update aws, apache parent by Tilman Hausherr · 4 weeks ago
  74. 248fee1 TIKA-4605: move grpc version to parent by Tilman Hausherr · 5 weeks ago
  75. 1eb469a Bump io.grpc:grpc-context from 1.69.0 to 1.78.0 (#2513) by dependabot[bot] · 5 weeks ago
  76. c9a866d Bump com.google.j2objc:j2objc-annotations from 3.0.0 to 3.1 (#2514) by dependabot[bot] · 5 weeks ago
  77. b2e0dd5 Bump google-http-client.version from 2.0.0 to 2.0.3 (#2512) by dependabot[bot] · 5 weeks ago
  78. 31a9344 TIKA-4327: update aws, jsoup, puppycrawl by Tilman Hausherr · 5 weeks ago
  79. 20f92e9 TIKA-4582 (#2467) by Tim Allison · 5 weeks ago
  80. 4ee9e39 TIKA-4610 -- modernize rat exclusions and add exclusion for README.md (#2511) by Tim Allison · 5 weeks ago
  81. e284817 TIKA-4609: Fix Maven verbosity flags (remove line breaks) (#2510) by Nicholas DiPiazza · 5 weeks ago
  82. 9315d52 Revert "TIKA-4609: Reduce Maven verbosity in GitHub Actions workflows" (#2509) by Nicholas DiPiazza · 5 weeks ago
  83. 218508d TIKA-4609: Reduce Maven verbosity in GitHub Actions workflows by Nicholas DiPiazza · 5 weeks ago
  84. 7109dcf TIKA-4608 -- clean up metadata filter api (#2507) by Tim Allison · 5 weeks ago
  85. 0871544 TIKA-4607 - rm DigestingParser from 4.x (#2506) by Tim Allison · 5 weeks ago
  86. c9d1ec8 TIKA-4581 - rm metadata filter where it isn't needed any more (#2468) by Tim Allison · 5 weeks ago
  87. 722ba62 try to get jenkins builds working again by tallison · 5 weeks ago
  88. d091813 try to fix memory pressure complaints in opensearch integration tests by tallison · 5 weeks ago
  89. 1315494 TIKA-4605: Add Google Drive fetcher plugin (#2504) by Nicholas DiPiazza · 6 weeks ago
  90. 691e304 Bump org.apache.logging.log4j:log4j-core in /tika-e2e-tests (#2501) by dependabot[bot] · 6 weeks ago
  91. d5918e6 TIKA-4598: Move tika-pipes-ignite from plugin to standalone module (#2499) by Nicholas DiPiazza · 6 weeks ago
  92. 88e4350 TIKA-4604: Add Atlassian JWT fetcher plugin (#2502) by Nicholas DiPiazza · 6 weeks ago
  93. b5aaa89 TIKA-4600: Add E2E tests for tika-grpc (#2500) by Nicholas DiPiazza · 6 weeks ago
  94. 0b753f5 TIKA-4327: revert mistake by Tilman Hausherr · 6 weeks ago
  95. 69f13ba TIKA-4327: update grpc by Tilman Hausherr · 6 weeks ago
  96. 9ccf59e TIKA-4585 -- simplify serialization (#2471) by Tim Allison · 6 weeks ago
  97. 34b60d6 TIKA-4595: Dynamic Fetcher/Emitter Management with ConfigStore Support (#2489) by Nicholas DiPiazza · 6 weeks ago
  98. 771e649 Bump org.netpreserve:jwarc from 0.32.0 to 0.33.0 (#2495) by dependabot[bot] · 6 weeks ago
  99. fd14acf Bump software.amazon.awssdk:bom from 2.40.13 to 2.40.16 (#2494) by dependabot[bot] · 6 weeks ago
  100. 1c7f572 Bump twelvemonkeys.version from 3.12.0 to 3.13.0 (#2490) by dependabot[bot] · 6 weeks ago