I am encountering an issue where the content auditor is not displaying the reading grade report.
I suspect this is because this collection only indexes documents (pdf,docx,xlsx).
Is the reading grade report limited to only generate for web pages or is something incorrectly configured in this collection?
Hi @vmanera -
It sounds like you’ve been working with a collection initially created in an earlier version of Funnelback. In v15.10, several new metrics were added to the Content Auditor, requiring a change to an existing collection’s filter classes to detect misspellings, reading levels, and the like.
The default values for new collections for filter.classes
in collection.cfg
in v15.8 are shown at:
Any filtered document will be subjected to filters in this chain, including DOC(X), PDF, XSL(X). You will need to perform a full collection update for any filter configuration changes to take effect, though.
Cheers Gordon, that was exactly the fix. In this instance I had to ensure the JSoupProcessingFilterProvider was added to the chain of filters
ie.
filter.classes=FAChecker:CombinerFilterProvider,TikaFilterProvider,ExternalFilterProvider:JSoupProcessingFilterProvider:DocumentFixerFilterProvider