If you're concerned about both indexing AND display, you may want to try using the InjectNoIndexFilter as part of your filter chain:
https://docs.funnelback.com/14.2/filter_classes_collection_cfg.html#InjectNoIndexFilterProvider
Provided you add the chain after a file has been filtered from binary content to text/html, the InjectNoIndexFilter configuration would then need to be fairly inclusive:
# collection.cfg
...
# Add InjectNoIndex to Filter Chain
filter.classes=CombinerFilterProvider,TikaFilterProvider,ExternalFilterProvider:DocumentFixerFilterProvider:InjectNoIndexFilterProvider
# Configure InjectNoIndexFilterProvider to exclude all content
filter.noindex.1=.* head,body
# Consider domain and path portions of URL as metadata at query-time
query_processor_options=-uv
You may not be able to get around indexing the <title> tag (extracted from embedded metadata), but you could choose to display the liveUrl instead of the result's title at template time. If you're only interested in display and don't care about indexing, a template-only fix would be appropriate.