XCRI Knowledge Base

Setting standards for more effective courses information management

  • Increase font size
  • Default font size
  • Decrease font size

All About XCRI

The XCRI community blog: all about eXchanging Course Related Information (XCRI) and its Course Advertising Profile (XCRI-CAP).

Blog entries categorized under data specification

data specification

13 posts in this category
alanepaull

Typing woes

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Monday, 04 February 2013 data specification 0 Comments

How do we use XCRI-CAP to enable feed consumers to filter out the course records they want from those they don't want? A fundamental question that was asked when we first started to design XCRI back in the day. This post, and a reiteration of that question as a starting point, was stimulated by Qamar Zaman's blog post "XCRI and Qualification" (https://atiqam.wordpress.com/xcri-and-qualification/).

XCRI-CAP 1.2 has many features that permit filtering in theory. These include at course level: subject, type and credit >> level; at presentation level: age, duration, studyMode, attendanceMode, attendancePattern; at qualification level: educationLevel and type. These elements were deliberately included in order to help with filtering, both for consumption of feeds and for search.

However, XCRI-CAP is primarily a structural specification - it specifies, for example, that if you have a course title, this is where you should put it. The spec itself doesn't prescribe the content of elements, except for some suggestions (not binding) for studyMode, attendanceMode and attendancePattern. That's why we have a Data Definitions document for the Course Data Programme, and even that is to an extent loosely specified - and could do with tightening up, once we have agreement on the content. For machine-readability this is not ideal, but it has helped to enable many organisations to produce an XCRI-CAP feed, and we already have some aggregation taking place, and some services.

When XCRI-CAP was designed, there were very few generally accepted vocabularies for key information items that enable divvying up of the data. Therefore, the designers were loathe to include them in the spec, as that could easily have restricted its take-up by negating potential use cases. On the other hand, producers of XCRI-CAP feeds need to know what XCRI-CAP feed consumers want inside many of these elements, so that the data can be filtered and consumed with a minimum of unnecessary intervention. This is the reason why the emergence of communities of practice within the Course Data Programme (for example, around Graduate Prospects, and the Creative Assembly, and later UCAS) has been so encouraging and so important.

As I've mentioned, XCRI-CAP 1.2 does include several data elements that can help, if populated with agreed vocabularies. Some are fairly well specified, such as and , while others (I'm looking at in all its forms as an example) less so. We can enumerate qualification type through various well-established frameworks (for example NQF or QCF). We have JACS and other vocabularies for subjects, and we have suggested vocabularies for studyMode, attendanceMode and attendancePattern.

The purpose of the element in course is to provide a filtering mechanism not already covered by elements such as studyMode, subject, qualification or educationLevel. An archetypal "type" is 'continuing professional education' courses; these cannot be readily extracted using existing elements, because they typically carry no credit or level, you cannot pick them out with just a subject vocabulary, duration or other easy descriptor without analysing free text descriptions. It also seems to me not unreasonable that an aggregator might want to pull out CPD courses (in fact we already have two specific cases of this). This is not an isolated use case. Consider perhaps 'Open Learning' courses, or 'Continuing Education', or even 'Undergraduate' - under-specified for level in most frameworks, or 'Postgraduate Taught' and 'Postgraduate Research'. The current state of XCRI-CAP design does not permit, without more vocabularies, these groups of courses to be filtered easily.

I think we legitimately have several axes (pl. of axis and pl. of axe!) here with which to slice up course provision, independent of educationLevel, qualification abbreviation, studyMode, subject and others explicitly defined:

  • qualification type: For example - 'GCSE or Equivalent', 'Foundation Degree', 'Postgraduate Qualification' [As an aside, importantly, I note that there is an error in the Data Definitions: there *should* be an element for qualification type; it's in the schemas but not the data definitions. This may help, as there are some useful qualtype vocabularies around that don't necessarily equate simply to 'level'.]
  • course type (inter-institution context): For example 'Continuing Professional Development', 'Open Learning', 'Continuing Education', 'Undergraduate', 'Postgraduate Taught', 'Postgraduate Research', 'Summer School', 'Researcher Training'.
  • course type (structural component type within an HEIs offerings): For example - 'module', 'programme', 'stage'; as used in the HEAR XML specification
  • course type (community practice): see CPD community practice below.
  • module / programme relations: could use hasPart / isPartOf, coupled with the structural component type (again as in the HEAR XML specification)

A typical usage in the CPD community might be:

This example is already implemented in the Course Data Programme schema, and the vocabulary is published as a VDEX file here: https://xcri.co.uk/vocabularies/courseTypeCPD1_0.xml.

My view is that communities of practice will have specific requirements for these vocabularies, which will be different in different communities. For example, Graduate Prospects may well wish to split up PG courses into different types, not necessarily linked directly to educationLevel; some "course type" terms might look like educationLevel terms, but they are being used in a different course-based context. For PG courses, you might want to identify CPD, Taught, and Research courses and perhaps researcher training, and these terms might be sufficient in the element. [Bearing in mind that multiple elements are permitted, and we can use xsi:type to prescribe and validate vocabularies.]

So there are many ways to slice and dice chunks of course provision, and XCRI-CAP 1.2 has elements that can enable this. We collectively need to determine what chunks need identification over and above the reasonably well specified stuff like subject, educationLevel and study mode. We can implement multiple vocabulary elements if required - a type vocab doesn't have to have mutually exclusive terms. And in my experience starting with agreement on a small number of terms is better than trying to get to an all-encompassing vocabulary before using it.

Here's 3, for a small start.

  • 'Continuing Professional Development'
  • 'Undergraduate'
  • 'Postgraduate'

Implement with the following XML:


And also some components:

  • 'Programme'
  • 'Pathway'
  • 'Stage'
  • 'Year'
  • 'Module'

Implement with the the following XML:


These will validate against the Course Data Programme schema, but should be considered a pilot implementation.

 

Tags: Untagged
alanepaull

Perils of typing

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Friday, 30 November 2012 data specification 0 Comments
"With some trepidation" is how I started my recent email to the CourseDataStage1 mailing list, as I asked for comments on a suggestion about a vocabulary for course 'type'. We have an ongoing robust discussion.
Tags: Untagged
Hits: 12483 Continue reading →
alanepaull

The ordering of XCRI-CAP data items

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Wednesday, 18 July 2012 data specification 1 Comment

A question just arose from a project about the correct ordering of XCRI-CAP data items; it's required by the validator, but is not actually part of the spec. So I had a few thoughts about that.

Although ordering of data items is not part of the XCRI-CAP 1.2 specification, it *is* part of the XCRI-CAP 1.2 XML schema. I can therefore confirm that the order of the data elements in XCRI-CAP feeds is important!

The specification, as published on the wiki at xcri.org/wiki, is a formal description of XCRI-CAP data structures and data items. It can be implemented and coded in many different ways, for example through a W3C schema (as we have used in Craig's validator and published at https://xcri.co.uk/bindings/xcri_cap_1_2.xsd), through an RDF schema, through schematron or through JSON. Each of these implementation formats is generally referred to as a 'binding', and each can meet the requirements of the specification. However, as they use different implementation technologies, they are only directly inter-changeable if a developer produces something to convert one implementation format into another - in some cases there may be off-the-shelf ways of doing this, possibly.

The XCRI-CAP community has mostly used a W3C XML schema approach, and we've re-used some linked schemas (Dublin Core for example) to save re-designing common data elements, like 'title' or 'identifier'. We have also implemented the XCRI-CAP schema, so that it is compliant with the European Norm, Metadata for Learning Opportunities (MLO).

It's interesting to note that The Standard is the spec, not the schema - the schema is an implementation of The Standard. And some standards don't have bindings yet - for example, there isn't an 'official' MLO binding, although Scott and I have implemented one in order to use it with XCRI-CAP. This gives us namespace problems, because without a binding, we can't use a 'real' namespace. So, for example, re-using the JACS coding system in a transparent way is problematic - it has no official interoperable binding, so no namespace. Also some standards don't have 'official' bindings, even if they have namespaces - Dublin Core, for example, though I admit that we've tended to use the published DC schemas as if they are standards.

There are other ways of doing this - in fact, I have a different XCRI-CAP 1.2 implementation that just puts all the XCRI-CAP items in one schema, thereby avoiding any namespace difficulties. However, this is not the 'official' schema. If anyone wants that, just let me know.

Alan

Tags: Untagged
Hits: 23660 Continue reading →

Recent comment in this post Show all comments

alanepaull

Give 'em good data (or they'll just use the bad stuff)

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Wednesday, 04 July 2012 data specification 0 Comments
I've been looking at the Course Data Programme interim reports recently, as part of my role in the XCRI Support Project. One question that has arisen ...
Tags: Untagged
Hits: 29113 Continue reading →
alanepaull

LRMI, schema.org and XCRI-CAP

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Thursday, 14 June 2012 data specification 0 Comments
It's looking like there might be possibilities for linking the Learning Resources Metadata Initiative (LRMI) and XCRI-CAP, in some way. LRMI is part o...
Tags: Untagged
Hits: 35275 Continue reading →
alanepaull

Data visualisation and advanced MUSKETry

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Wednesday, 28 March 2012 data specification 0 Comments
One of Jamie Mahoney's ON Course blog posts suggested this train of thought. It was started by a quote from Edward Tufte’s 'Visual Display of Quantita...
Tags: Untagged
Hits: 12050 Continue reading →
alanepaull

Part time pre-Ellumination

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Wednesday, 14 March 2012 data specification 0 Comments
We have a second Elluminate session on XCRI-CAP data definitions and vocabularies this afternoon, so naturally I've been ruminating on the subject. So...
Tags: Untagged
Hits: 19209 Continue reading →
alanepaull

Round 2 of data definitions continues

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Monday, 05 March 2012 data specification 0 Comments
I've been amending v1.6 of the Data Definitions document and v2.0 of the Vocabulary Framework for a few days now. Rather more has come up than expecte...
Tags: Untagged
Hits: 30684 Continue reading →
alanepaull

BS 8581 and all that jazz

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Tuesday, 21 February 2012 data specification 0 Comments
BS 8581-1 is the catchy name for the emerging British standard that is / was XCRI-CAP. Members of BSI committee IST/43 (trips of the tongue, no?) have...
Tags: Untagged
Hits: 35383 Continue reading →
alanepaull

Vocabulary Framework

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Wednesday, 15 February 2012 data specification 0 Comments
I've now at last published the draft Vocabulary Framework Document for the Course Data Programme: you can get it at https://xcri.co.uk/KbLibrary/Co...
Tags: Untagged
Hits: 28087 Continue reading →
alanepaull

Straw man

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Wednesday, 25 January 2012 data specification 0 Comments
Well, the first public draft of the straw man data definitions document has been published (Version 1.6 of Data Definitions). It's rather longer than ...
Tags: Untagged
Hits: 22535 Continue reading →
alanepaull

Working group working

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Wednesday, 18 January 2012 data specification 0 Comments
The working group on data specification and vocabularies has started. Many thanks to the early birds who've already begun to comment on the initial 's...
Hits: 23012 Continue reading →
alanepaull

Welcome to the data specification and vocabularies blog!

by alanepaull
alanepaull
I am an information management consultant, running my own 'husband and wife' com
User is currently offline
on Wednesday, 18 January 2012 data specification 0 Comments
This blog section is to focus on the work we're doing on the data definitions (or specifications, I'm not sure quite which one I prefer) and vocabular...
Tags: Untagged
Hits: 42759 Continue reading →

News

Prev Next

The fifteenth issue of the Course Data Programme Stage 2... Read more

The fourteenth issue of the Course Data Programme Stage 2... Read more

The thirteenth issue of the Course Data Programme Stage 2... Read more

The twelfth issue of the Course Data Programme Stage 2... Read more

The eleventh issue of the Course Data Programme Stage 2... Read more

The KIS data has now been launched.  Data from all... Read more

The tenth issue of the Course Data Programme Stage 2... Read more

The ninth issue of the Course Data Programme Stage 2... Read more

Congratulations to all those who have been successful in their... Read more

The eigth issue of the Course Data Programme Stage 2... Read more

The seventh issue of the Course Data Programme Stage 2... Read more

The sixth issue of the Course Data Programme Stage 2... Read more

XCRI Interactive


JISC