Broken link reports

Hi,

On this page Data reports - Funnelback Documentation - Version 15.18.0 it says that the reports are generated and saved in $SEARCH_HOME/admin/data_report//

Can these reports be easily shown on the server?

I tried https:///search/admin/data_report//data_report.cgi?collection=&report=overall_broken_links_list.html and it says it could not be found.

Assuming you are logged into Funnelback, the below URL should take you to your broken links breakdown :slight_smile:

https://{your-search-url-here}/search/admin/data_report.cgi?collection={your-collection-name-here}&report=broken_links_breakdown.html

Or you can navigate there via your administration panel, through the Analyse tab and then ‘View Data Reports Dashboard’

If this doesn’t work, check that you have data_report=true in your collection.cfg

Bear in mind that the broken link reporting is not complete in the sense that a normal broken link report would be.

It reports on any broken link that Funnelback logged during it’s crawl - but it won’t check for things that it didn’t attempt to access like linked resources (images/css) or external links that fall outside of the collection’s include/exclude rules.

Is there a way to access these reports through the Funnelback API?

What I would like to do is somehow get theese reports that a generated by Funnelback into Matrix somehow so they can be published online.

There isn’t a way to access these via the API.

It it also worth keeping in mind that the broken link reporting is only of limited use and isn’t intending to be a comprehensive broken link checking solution - it only reports on broken links that Funnelback attempted to fetch during a crawl - so links that fall outside the include/exclude rules, internal links to unwanted file types (such as JS, CSS, images) and other rejected URLs (e.g. items larger than the max download size) won’t be checked at all.

1 Like