Re: Standardized documentation metadata

Pete Travis <me@xxxxxxxxxxxxxx> · Mon, 13 Apr 2015 17:53:50 -0600

On 03/25/2015 11:28 PM, Pete Travis wrote:
> Hi all,
>
> I'm working through the logistics of consuming arbitrary markup
> formats.  It would be helpful to have a standardized way to describe a
> document.  The example below is YAML, because it is easy to digest,
> write, and read, and I think the structure covers the important
> attributes without being cumbersome.
>
> ---
> title: Example Document
> stub: A representation of a document description standard.
> abstract: >
>     This could be an example document.
>     It could explain example documents.
>     It could use examples to explain what a document is.
>     It is none of these.
> author:
> - Carlton Bentleby
> - Vizek Glibensky
> - Obvious Pseudonym
> taxonomy:
>     theme: Contributions
>     scope: Documentation
>     track: Metadata
>     project: anerist
> tags:
> - yaml
> - metadata
> - proposal
> - standard
> ---
>
> title, stub, and abstract are self-apparent. 
>
> Author seems like a MAY item;  for a standard to go further than our
> group, it'll need that, but staking a line around a collaborative work
> MAY discourage collaboration.
>
> The taxonomy can be in any order, but in function it's hierarchical;
> theme at a very abstract level, scope being more refined, track gets
> down to a specific topic.  The project differentiates between different
> approaches to addressing a given topic (and hey, people _will_ look for
> answers on a specific solution).
>
> Tags give some SEO value, could help crossreferences, site search
> features... all that stuff that tags do.
>
> What do you think?  How can it be better?
>

Establishing some sort of categorization for content in each work we
want to publish is an imperative for the CI-driven site buildsystem I'm
working on.  I'd really like to apply the expertise in the group to
forging this into a tenable standard, and require it for future
contributions.

For our existing guides, there was some concern expressed about
maintaining the abstract and stub (ie subtitle) in both the Docbook
markup and in the metadata, so I've already built a utility to
programatically extract that information and update the metadata file. 
It occurs to me now that the author list could be retrieved as well...TODO.

I'll start the conversation by pointing out two attributes that are
missing from the example.  First, the license; we're CC-BY-SA by default
but should be able to accept and state any acceptable license.  The
second came to mind after Ryan's question in that other thread; labeling
a 'class' or 'type' of a document, ie 'book', 'article', 'tutorial', etc.

Most of these fields should have predefined acceptable values.  Approved
content licenses, acceptable types,  standardized taxonomy.  We should
discuss that too.

-- 
-- Pete

-- 
docs mailing list
docs@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe:
https://admin.fedoraproject.org/mailman/listinfo/docs