More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

Ken Krugler
Hi all,

I got past the JCE issue, but now some tests are failing with timeouts.

For this test:

[INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest

I get 100s of these warnings:

Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE

And then:

[ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: 318.512 s <<< FAILURE! - in org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
[ERROR] org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint  Time elapsed: 308.223 s  <<< ERROR!
org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
        at org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email]
        at org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email]
        at org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)

Similarly, for:

[INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest

Many of these:

Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE

And then similarly:

[ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: 309.375 s <<< FAILURE! - in org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
[ERROR] org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint  Time elapsed: 307.9 s  <<< ERROR!
org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
        at org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email]
        at org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email]
        at org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)

And now:

[INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
[INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
[INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
[INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE
Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE
Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser
WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE
… and so on…

Any suggestions?

Thanks!

— Ken


--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr

Reply | Threaded
Open this post in threaded view
|

Re: More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

Arvid Heise-3
Hi Ken,

just to double check, did you intend to send this mail to the tika dev
list? I actually don't know what to do with your email.

Best,

Arvid

On Sat, Nov 21, 2020 at 11:43 PM Ken Krugler <[hidden email]>
wrote:

> Hi all,
>
> I got past the JCE issue, but now some tests are failing with timeouts.
>
> For this test:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>
> I get 100s of these warnings:
>
> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then:
>
> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 318.512 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
> [ERROR]
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
> Time elapsed: 308.223 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: [hidden email]
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> [hidden email]
>         at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>
> Similarly, for:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>
> Many of these:
>
> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
>
> And then similarly:
>
> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
> 309.375 s <<< FAILURE! - in
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
> [ERROR]
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
> Time elapsed: 307.9 s  <<< ERROR!
> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> SAXParser; This could indicate that a parser has not correctly released its
> SAXParser. Please report this to the Tika team: [hidden email]
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> minutes for a SAXParser; This could indicate that a parser has not
> correctly released its SAXParser. Please report this to the Tika team:
> [hidden email]
>         at
> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>
> And now:
>
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
> acquireSAXParser
> WARNING: Contention waiting for a SAXParser. Consider increasing the
> XMLReaderUtils.POOL_SIZE
> … and so on…
>
> Any suggestions?
>
> Thanks!
>
> — Ken
>
>
> --------------------------
> Ken Krugler
> http://www.scaleunlimited.com
> custom big data solutions & training
> Hadoop, Cascading, Cassandra & Solr
>
>

--

Arvid Heise | Senior Java Developer

<https://www.ververica.com/>

Follow us @VervericaData

--

Join Flink Forward <https://flink-forward.org/> - The Apache Flink
Conference

Stream Processing | Event Driven | Real Time

--

Ververica GmbH | Invalidenstrasse 115, 10115 Berlin, Germany

--
Ververica GmbH
Registered at Amtsgericht Charlottenburg: HRB 158244 B
Managing Directors: Timothy Alexander Steinert, Yip Park Tung Jason, Ji
(Toni) Cheng
Reply | Threaded
Open this post in threaded view
|

Re: More issues with top-level build for Tika 1.25 rc1 - Waited more than 5 minutes for a SAXParser

Ken Krugler
Hi Arvid (and the dev list),

Sorry - I was busy working on two open source projects at the same time, and messed up. The Tika dev list and the Flink dev list both show up as “dev” in Apple Email :(

This also explains why I didn’t get a response from my fellow Tika devs...

— Ken


> On Nov 23, 2020, at 4:34 AM, Arvid Heise <[hidden email]> wrote:
>
> Hi Ken,
>
> just to double check, did you intend to send this mail to the tika dev
> list? I actually don't know what to do with your email.
>
> Best,
>
> Arvid
>
> On Sat, Nov 21, 2020 at 11:43 PM Ken Krugler <[hidden email]>
> wrote:
>
>> Hi all,
>>
>> I got past the JCE issue, but now some tests are failing with timeouts.
>>
>> For this test:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>>
>> I get 100s of these warnings:
>>
>> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>>
>> And then:
>>
>> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 318.512 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest
>> [ERROR]
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint
>> Time elapsed: 308.223 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released its
>> SAXParser. Please report this to the Tika team: [hidden email]
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> [hidden email]
>>        at
>> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
>>
>> Similarly, for:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>>
>> Many of these:
>>
>> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>>
>> And then similarly:
>>
>> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed:
>> 309.375 s <<< FAILURE! - in
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest
>> [ERROR]
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint
>> Time elapsed: 307.9 s  <<< ERROR!
>> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException
>> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
>> SAXParser; This could indicate that a parser has not correctly released its
>> SAXParser. Please report this to the Tika team: [hidden email]
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
>> minutes for a SAXParser; This could indicate that a parser has not
>> correctly released its SAXParser. Please report this to the Tika team:
>> [hidden email]
>>        at
>> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281)
>>
>> And now:
>>
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
>> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed:
>> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest
>> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest
>> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils
>> acquireSAXParser
>> WARNING: Contention waiting for a SAXParser. Consider increasing the
>> XMLReaderUtils.POOL_SIZE
>> … and so on…
>>
>> Any suggestions?
>>
>> Thanks!
>>
>> — Ken

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr