Exclude index page

I need to exclude the index page or homepage from the search results. I cannot add the index link in the exclude content from, as it will exclude the entire collection.

Has anyone got any suggestions of how to achieve this.

Thanks

Hi Tara,

There are two ways to do this.

  1. you can add a robots meta tag to your index page <meta name="robots" content="noindex,follow" />. This will instruct all web crawlers to follow the links on the page but not index the page itself.

  2. You can create a kill_exact.cfg file that contains the URL of your homepage. Funnelback will then exclude this page from the index at index time.

Details on kill_exact.cfg can be found here:

To create the file you need to log in to the administration interface, switch to your collection then access the file manager (browse collection configuration files, from the administer tab)

Then you create a kill_exact.cfg using the create dropdown and add the URL of the page to kill to the file. You can apply the kill straight away by reindexing your site - run an advanced update / reindex the live view from the update tab in the administration interface.

It’s also possible to kill entire sections using the kill_partial.cfg - this removes all pages starting with a common URL prefix from the index.

1 Like

In practice this would look like:

<meta name="robots" content="follow,noindex"/>

The downside of this approach is that all well-behaved robots would obey that directive, not just Funnelback.

See also:

Thanks Gordon - I didn’t notice that the markdown editor hid the meta tag because it was html.