- 8ef279d TIKA-4327: update aws, netty, woodstox, plexus by Tilman Hausherr · 2 days ago main
- 4b66205 TIKA-4736 -- image extraction fails (#2828) by Tim Allison · 2 days ago
- 19b4c66 TIKA-4728 - fix xhtml in widgets (#2817) by Tim Allison · 3 days ago
- c2b15c9 TIKA-4733 -- fix docker-snapshot.yml to match new release zip artifacts (#2827) by Tim Allison · 3 days ago
- 0b38268 TIKA-4735 -- fix content-only (#2826) by Tim Allison · 3 days ago
- da1801a TIKA-4733 -- improve release artifact robustness and documentation (#2825) by Tim Allison · 3 days ago
- 52530af TIKA-4327: update junit6 by Tilman Hausherr · 4 days ago
- 230d635 TIKA-4327: update aws, jbig2 by Tilman Hausherr · 4 days ago
- 5fb9402 TIKA-4327: update aws, enforcer plugin, azure by Tilman Hausherr · 5 days ago
- a803c16 TIKA-4732 (#2820) by Tim Allison · 6 days ago
- ae9adc3 Bump eu.maveniverse.maven.nisse:extension from 0.8.4 to 0.9.0 (#2823) by dependabot[bot] · 6 days ago
- a5c6b87 Bump org.codehaus.plexus:plexus-classworlds from 2.10.0 to 2.11.0 (#2821) by dependabot[bot] · 6 days ago
- b34a066 Bump org.apache.maven:maven-model from 3.9.15 to 3.9.16 (#2822) by dependabot[bot] · 6 days ago
- 39b0795 TIKA-4327: add comment by Tilman Hausherr · 7 days ago
- 168c064 TIKA-4327: remove dependencies that are in parent by Tilman Hausherr · 7 days ago
- a15d126 TIKA-4327: update aws, junrar, swagger, spotless by Tilman Hausherr · 8 days ago
- d19d3d4 Set up default protection ruleset for default and release branches (#2819) by The Apache Software Foundation · 8 days ago
- 000c6b4 TIKA-4327: update aws, asm by Tilman Hausherr · 8 days ago
- 69c2c80 TIKA-4327: update slf4j by Tilman Hausherr · 9 days ago
- 465bc76 junk-detector-v6 (#2818) by Tim Allison · 9 days ago
- de9ea3d add tika-eval into tika-app (#2816) by Tim Allison · 10 days ago
- c136fa0 TIKA-4727 -- Small tweaks: improve embedded file name handling and add pagination in hslf (#2812) by Tim Allison · 10 days ago
- 5463cd6 TIKA-4727: improve parsemode configuration (#2815) by Tim Allison · 10 days ago
- 63cbdd7 TIKA-4727: improvements for vlm (#2814) by Tim Allison · 10 days ago
- 6af5518 improve jina-integration (#2813) by Tim Allison · 10 days ago
- 3bbf65c TIKA-4723 follow-up: fix sqlite3 shade filter and correct docs (#2810) by Nicholas DiPiazza · 11 days ago
- a9b2a86 TIKA-4723 (#2809) by Tim Allison · 12 days ago
- 0b65830 TIKA-4725 - updates to docker release workflow (#2807) by Tim Allison · 12 days ago
- cf1a656 docs/pipes-updates (#2808) by Tim Allison · 12 days ago
- c297247 [TIKA-4724] Use IANA-registered text/markdown as primary type (#2806) by Shawn Rutledge · 13 days ago
- 828e5b1 TIKA-4327: update groovy-all by Tilman Hausherr · 13 days ago
- 2c0773f TIKA-4327: revert update groovy-all by Tilman Hausherr · 13 days ago
- 93434d9 TIKA-4327: update groovy-all by Tilman Hausherr · 13 days ago
- e220372 Bump org.apache:apache from 37 to 38 (#2805) by dependabot[bot] · 13 days ago
- dc7514b TIKA-4327: update aws, log4j, microsoft-graph by Tilman Hausherr · 2 weeks ago
- 2ef4f7d TIKA-4327: update aws, log4j by Tilman Hausherr · 2 weeks ago
- b723055 Add maxPages option to PDFParserConfig to limit page processing (#2803) by Julien Nioche · 2 weeks ago
- 381184f TIKA-4327: update sqlite by Tilman Hausherr · 2 weeks ago
- 6167f10 TIKA-4327: update google-auth-library-oauth2-http, google cloud, aws by Tilman Hausherr · 2 weeks ago
- fe023c3 TIKA-4327: add header by Tilman Hausherr · 2 weeks ago
- 2a4353e TIKA-4327: update aws, jaxb, netty by Tilman Hausherr · 3 weeks ago
- 9ac7129 update-4x-docs (#2802) by Tim Allison · 3 weeks ago
- 315704a Merge branch 'main' of https://gitbox.apache.org/repos/asf/tika by Tilman Hausherr · 3 weeks ago
- 0fefbe6 TIKA-4327: update google api services by Tilman Hausherr · 3 weeks ago
- 446bc14 [maven-release-plugin] prepare for next development iteration by tallison · 3 weeks ago
- b077f09 [maven-release-plugin] prepare release 4.0.0-alpha-1-rc1 by tallison · 3 weeks ago 4.0.0-alpha-1
- 56f7965 prep for 4.0.0-alpha-1 fix scm by tallison · 3 weeks ago
- 9c034c1 [maven-release-plugin] rollback the release of 4.0.0-alpha-1-rc1 by tallison · 3 weeks ago
- 07ef90e [maven-release-plugin] prepare release 4.0.0-alpha-1-rc1 by tallison · 3 weeks ago
- 50e559f prep for 4.0.0-alpha-1 release -update unit test by tallison · 3 weeks ago
- b4551e7 prep for 4.0.0-alpha-1 release by tallison · 3 weeks ago
- b683355 TIKA-4327: Replace hardcoded Micronaut version with variable (#2801) by dependabot[bot] · 3 weeks ago
- 36211dd TIKA-4683 -- hyphens in ooxml (#2799) by Tim Allison · 3 weeks ago
- ddaebd0 TIKA-4683 -- fix ole digesting (#2798) by Tim Allison · 3 weeks ago
- 4b78311 TIKA-4683 -- charset detector dep mgmt and order in AutoDetectReader (#2800) by Tim Allison · 3 weeks ago
- d90734b TIKA-4327: update aws, swagger-annotations, mime4j, opennlp, zstd, joda-time by Tilman Hausherr · 3 weeks ago
- e8c36c9 TIKA-4722: Add parse_context_json field to FetchAndParseRequest for per-request ParseContext configuration (#2797) by Nicholas DiPiazza · 3 weeks ago
- 0300d94 TIKA-4327: update aws, jackson, grpc, micronaut by Tilman Hausherr · 3 weeks ago
- 6c78186 TIKA-4683-rollback-encoding-detection (#2796) by Tim Allison · 3 weeks ago
- aeea39e improve epub handling of truncated files (#2795) by Tim Allison · 3 weeks ago
- 66a83d3 charset and junk tweaks (#2794) by Tim Allison · 3 weeks ago
- 6b53816 Automatically add slices and/or log underprovisioned pipes configurations (#2793) by Tim Allison · 4 weeks ago
- 4e0340b TIKA-4721: Fix TOCTOU race in SharedServerManager port assignment (#2791) by Nicholas DiPiazza · 4 weeks ago
- c9da280 switch to OK... we're not actually testing anything with SLOW (#2788) by Tim Allison · 4 weeks ago
- b6bcce6 TIKA-4327: update open-nlp by Tilman Hausherr · 4 weeks ago
- 0eac3d3 TIKA-4703: Fix tika-grpc container permissions for plugin directory (#2792) by Nicholas DiPiazza · 4 weeks ago
- bcfc0ea TIKA-4703: Fix tika-grpc Docker image missing runtime dependencies (#2790) by Nicholas DiPiazza · 4 weeks ago
- cb20b84 Bump org.jetbrains.kotlin:kotlin-stdlib from 2.3.20 to 2.3.21 (#2789) by dependabot[bot] · 4 weeks ago
- 3ad2e84 TIKA-4327: remove unneeded dependency by Tilman Hausherr · 4 weeks ago
- 9eb4f59 TIKA-4327: add comment about mchange-commons-java by Tilman Hausherr · 4 weeks ago
- da79f42 TIKA-4327: update gson by Tilman Hausherr · 4 weeks ago
- 0c99d53 TIKA-4720-wiring (#2787) by Tim Allison · 4 weeks ago
- dc9f770 TIKA-4695: revert gson, mchange due to test failure by Tilman Hausherr · 4 weeks ago
- 2ebdc22 TIKA-4327: update aws, commons-codec, c3p0, gson, mchange by Tilman Hausherr · 4 weeks ago
- e63170f TIKA-4720 -- Move charset detection to byte-bigram Naive Bayes pipeline (#2784) by Tim Allison · 4 weeks ago
- 7d34f9e TIKA-4719 -- Universalish junk detector (#2783) by Tim Allison · 4 weeks ago
- 638cbb2 TIKA-4327: update commons-io by Tilman Hausherr · 4 weeks ago
- 5dfa028 improve legacy charset detector to benefit from features of StandardHtmlEncodingDetector (#2786) by Tim Allison · 4 weeks ago
- e0d4a6d remove bestMatch (#2785) by Tim Allison · 4 weeks ago
- 4a43c2c TIKA-4703: Fix chmod failure in tika-grpc Dockerfile on CI (#2782) by Nicholas DiPiazza · 4 weeks ago
- dc99fff TIKA-4703: Fix Docker Hub secret name DOCKERHUB_USERNAME -> DOCKERHUB_USER (#2781) by Nicholas DiPiazza · 4 weeks ago
- c0d5296 TIKA-4703: Upgrade GitHub Actions to Node.js 24 compatible versions (#2780) by Nicholas DiPiazza · 4 weeks ago
- 0ae889f TIKA-4703: Pin docker/* actions to SHA digests per ASF policy (INFRA-27837) (#2779) by Nicholas DiPiazza · 5 weeks ago
- fd16980 TIKA-4327: update lombok by Tilman Hausherr · 5 weeks ago
- 65e8193 TIKA-4327: update aws, oak by Tilman Hausherr · 5 weeks ago
- 2260a19 TIKA-4703: Add Docker CI pipelines for tika-server and tika-grpc (#2715) by Nicholas DiPiazza · 5 weeks ago
- 306c79c TIKA-4327: update activation, jsoup, testcontainers, angus by Tilman Hausherr · 5 weeks ago
- 7a039f3 Add OCR encode parser module (#2769) by Cristian Zamfir · 5 weeks ago
- c12494c Bump eu.maveniverse.maven.nisse:extension from 0.8.3 to 0.8.4 (#2777) by dependabot[bot] · 5 weeks ago
- f5a4b55 Bump org.apache.maven:maven-model from 3.9.14 to 3.9.15 (#2778) by dependabot[bot] · 5 weeks ago
- c79648b TIKA-4327: update junrar, aws, google cloud, guava, reactor, spring, sqlite, swagger-annotations, oauth2, microsoft-graph by Tilman Hausherr · 5 weeks ago
- 9ecd958 charset-ship-today (#2776) by Tim Allison · 5 weeks ago
- 619077d 4x-reg-sax-fixes (#2773) by Tim Allison · 5 weeks ago
- 5bd8fbb update epub along the lines of oodt (#2774) by Tim Allison · 5 weeks ago
- 48e4ecc strip charset from mimes (#2775) by Tim Allison · 5 weeks ago
- 35ebe92 fix 'occured' -> 'occurred' in JoshuaNetworkTranslator log (#2772) by Sai Asish Y · 5 weeks ago
- 5396175 TIKA-4715 - try to fix osgi integration tests (#2758) by Tim Allison · 5 weeks ago
- 07e08fb clean up dwg parsing (#2770) by Tim Allison · 5 weeks ago
- c38a475 Merge remote-tracking branch 'origin/main' into 4x-reg-general-tweaks by tallison · 5 weeks ago
- b6c85ee Bump bouncycastle from 1.83 to 1.84 (#2771) by David Frizelle · 5 weeks ago