Thanks for the reply.
I've changed -SM to both (instead of meta) and this gives me a summary that's not null, but just an empty string e.g. "summary": ""
even though the search results appear relevant to the query so I imagine they're being indexed properly on update.
Below is our xml.cfg file:
PADRE XML Mapping Version: 2
t,1,,//title
document,/courses/course
docurl,/courses/course/url
L,1,,//level
S,1,,//subject
D,1,,//distancelearning
j,1,,//parttime
k,1,,//degreepreparation
m,1,,//overview
c,1,,//content
M,1,,//faculty
Notice that all of these are set to searchable content (1) now.
Below is our XML schema:
<courses>
<course>
<title>Accounting</title>
<url>
https://www...ac.uk/courses/undergraduate/accounting/
</url>
<overview>
<p>Studying accounting degree with us... blah blah blah</p>
</overview>
<content>
<p>The Development Programme is a core element... blah blah blah</p>
</content>
<level>Undergraduate</level>
<faculty>business school</faculty>
<subject>Accounting and finance</subject>
<distancelearning/>
<parttime/>
<degreepreparation>Y</degreepreparation>
</course>
<course>
<title>Business Analysis</title>
...
I'm kinda expecting something from one of these fields to be in the summary, is that right?
I'm also seeing another strange thing that I maybe ought to open a new ticket for, not sure if it's anything to do with these changes though - in the JSON, our content and overview mata data is being truncated and I'm not sure why:
{
"question": {...},
"response": {
"resultPacket": {
...
"results": [{
"rank": 1,
"score": 1000,
"title": "Art & Design",
"collection": "uos-courses-xml",
"component": 0,
"collapsed": null,
"liveUrl": "https://www...ac.uk/courses/pg/educationartdesign/",
"summary": "",
"cacheUrl": "/s/cache?collection=uos-courses-xml&doc=funnelback-web-crawl.warc&off=784948&len=7100&url=https%3A%2F%2Fwww...ac.uk%2Fcourses%2Fpostgraduatetaught%2Fsecondaryeducationartdesign%2F&profile=_default_preview",
"date": null,
"fileSize": 0,
"fileType": "txt",
"tier": 1,
"docNum": 404,
"exploreLink": null,
"kmFromOrigin": null,
"quickLinks": null,
"displayUrl": "https://www...ac.uk/courses/pg/educationartdesign/",
"clickTrackingUrl": "/s/redirect?collection=uos-courses-xml&url=https%3A%2F%2Fwww...ac.uk%2Fcourses%2Fpostgraduatetaught%2Fsecondaryeducationartdesign%2F&index_url=https%3A%2F%2Fwww...ac.uk%2Fcourses%2Fpostgraduatetaught%2Fsecondaryeducationartdesign%2F&auth=YgI0FkLuoXYncMjEfBW9cg&profile=_default_preview&rank=1&query=arts",
"explain": null,
"indexUrl": "https://www...ac.uk/courses/pg/educationartdesign/",
"gscopesSet": [],
"documentVisibleToUser": true,
"promoted": false,
"diversified": false,
"metaData": {
"c": "<p>As part of the course, you’ll have the opportunity to submit two of your assignments at Masters level. If you do this, you’ll be almost halfway towards a Masters in Education qualification. If you go on to do the Masters in Education, all",
"S": "Education,Teaching",
"k": "Y",
"L": "Postgraduate taught",
"m": "<p>As a teacher of art and design, you’ll be responsible for uncovering the creative spark in each and every student you teach. You’ll open young people’s eyes to endless creative possibilities and show them new means by which they can express"
},
"tags": [],
"customData": {}
},
{
...
The full text for these fields is present in the XML feeds, but I'm not sure why they are truncated by Funnelback. Anything obvious in my configuration that might be causing this?
Thanks