Hi all,
I got past the JCE issue, but now some tests are failing with timeouts. For this test: [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest I get 100s of these warnings: Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE And then: [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: 318.512 s <<< FAILURE! - in org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest [ERROR] org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint Time elapsed: 308.223 s <<< ERROR! org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 at org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email] at org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email] at org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) Similarly, for: [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest Many of these: Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE And then similarly: [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: 309.375 s <<< FAILURE! - in org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest [ERROR] org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint Time elapsed: 307.9 s <<< ERROR! org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 at org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email] at org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) Caused by: org.apache.tika.exception.TikaException: Waited more than 5 minutes for a SAXParser; This could indicate that a parser has not correctly released its SAXParser. Please report this to the Tika team: [hidden email] at org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) And now: [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils acquireSAXParser WARNING: Contention waiting for a SAXParser. Consider increasing the XMLReaderUtils.POOL_SIZE … and so on… Any suggestions? Thanks! — Ken -------------------------- Ken Krugler http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr |
Hi Ken,
just to double check, did you intend to send this mail to the tika dev list? I actually don't know what to do with your email. Best, Arvid On Sat, Nov 21, 2020 at 11:43 PM Ken Krugler <[hidden email]> wrote: > Hi all, > > I got past the JCE issue, but now some tests are failing with timeouts. > > For this test: > > [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest > > I get 100s of these warnings: > > Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > > And then: > > [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: > 318.512 s <<< FAILURE! - in > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest > [ERROR] > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint > Time elapsed: 308.223 s <<< ERROR! > org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException > from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) > Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a > SAXParser; This could indicate that a parser has not correctly released its > SAXParser. Please report this to the Tika team: [hidden email] > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) > Caused by: org.apache.tika.exception.TikaException: Waited more than 5 > minutes for a SAXParser; This could indicate that a parser has not > correctly released its SAXParser. Please report this to the Tika team: > [hidden email] > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) > > Similarly, for: > > [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest > > Many of these: > > Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > > And then similarly: > > [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: > 309.375 s <<< FAILURE! - in > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest > [ERROR] > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint > Time elapsed: 307.9 s <<< ERROR! > org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException > from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 > at > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) > Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a > SAXParser; This could indicate that a parser has not correctly released its > SAXParser. Please report this to the Tika team: [hidden email] > at > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) > Caused by: org.apache.tika.exception.TikaException: Waited more than 5 > minutes for a SAXParser; This could indicate that a parser has not > correctly released its SAXParser. Please report this to the Tika team: > [hidden email] > at > org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) > > And now: > > [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest > [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: > 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest > [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest > [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: > 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest > [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest > Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils > acquireSAXParser > WARNING: Contention waiting for a SAXParser. Consider increasing the > XMLReaderUtils.POOL_SIZE > … and so on… > > Any suggestions? > > Thanks! > > — Ken > > > -------------------------- > Ken Krugler > http://www.scaleunlimited.com > custom big data solutions & training > Hadoop, Cascading, Cassandra & Solr > > -- Arvid Heise | Senior Java Developer <https://www.ververica.com/> Follow us @VervericaData -- Join Flink Forward <https://flink-forward.org/> - The Apache Flink Conference Stream Processing | Event Driven | Real Time -- Ververica GmbH | Invalidenstrasse 115, 10115 Berlin, Germany -- Ververica GmbH Registered at Amtsgericht Charlottenburg: HRB 158244 B Managing Directors: Timothy Alexander Steinert, Yip Park Tung Jason, Ji (Toni) Cheng |
Hi Arvid (and the dev list),
Sorry - I was busy working on two open source projects at the same time, and messed up. The Tika dev list and the Flink dev list both show up as “dev” in Apple Email :( This also explains why I didn’t get a response from my fellow Tika devs... — Ken > On Nov 23, 2020, at 4:34 AM, Arvid Heise <[hidden email]> wrote: > > Hi Ken, > > just to double check, did you intend to send this mail to the tika dev > list? I actually don't know what to do with your email. > > Best, > > Arvid > > On Sat, Nov 21, 2020 at 11:43 PM Ken Krugler <[hidden email]> > wrote: > >> Hi all, >> >> I got past the JCE issue, but now some tests are failing with timeouts. >> >> For this test: >> >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest >> >> I get 100s of these warnings: >> >> Nov 21, 2020 10:28:38 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> >> And then: >> >> [ERROR] Tests run: 87, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: >> 318.512 s <<< FAILURE! - in >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest >> [ERROR] >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint >> Time elapsed: 308.223 s <<< ERROR! >> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException >> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 >> at >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) >> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a >> SAXParser; This could indicate that a parser has not correctly released its >> SAXParser. Please report this to the Tika team: [hidden email] >> at >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) >> Caused by: org.apache.tika.exception.TikaException: Waited more than 5 >> minutes for a SAXParser; This could indicate that a parser has not >> correctly released its SAXParser. Please report this to the Tika team: >> [hidden email] >> at >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341) >> >> Similarly, for: >> >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest >> >> Many of these: >> >> Nov 21, 2020 10:33:55 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> >> And then similarly: >> >> [ERROR] Tests run: 24, Failures: 0, Errors: 1, Skipped: 3, Time elapsed: >> 309.375 s <<< FAILURE! - in >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest >> [ERROR] >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint >> Time elapsed: 307.9 s <<< ERROR! >> org.apache.tika.exception.TikaException: TIKA-237: Illegal SAXException >> from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@e30d60 >> at >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) >> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a >> SAXParser; This could indicate that a parser has not correctly released its >> SAXParser. Please report this to the Tika team: [hidden email] >> at >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) >> Caused by: org.apache.tika.exception.TikaException: Waited more than 5 >> minutes for a SAXParser; This could indicate that a parser has not >> correctly released its SAXParser. Please report this to the Tika team: >> [hidden email] >> at >> org.apache.tika.parser.microsoft.ooxml.SXSLFExtractorTest.testUnsupportedPowerPoint(SXSLFExtractorTest.java:281) >> >> And now: >> >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest >> [INFO] Tests run: 36, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: >> 0.832 s - in org.apache.tika.parser.microsoft.ooxml.SXWPFExtractorTest >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest >> [WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: >> 0.053 s - in org.apache.tika.parser.microsoft.ooxml.TruncatedOOXMLTest >> [INFO] Running org.apache.tika.parser.microsoft.ooxml.xps.XPSParserTest >> Nov 21, 2020 10:39:05 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> Nov 21, 2020 10:39:06 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> Nov 21, 2020 10:39:07 PM org.apache.tika.utils.XMLReaderUtils >> acquireSAXParser >> WARNING: Contention waiting for a SAXParser. Consider increasing the >> XMLReaderUtils.POOL_SIZE >> … and so on… >> >> Any suggestions? >> >> Thanks! >> >> — Ken -------------------------- Ken Krugler http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr |
Free forum by Nabble | Edit this page |