[Koha-bugs] [Bug 17785] oai.pl returns wrong URLs under Plack
bugzilla-daemon at bugs.koha-community.org
bugzilla-daemon at bugs.koha-community.org
Mon Jul 23 07:30:23 CEST 2018
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=17785
--- Comment #13 from Pablo AB <pablo.bianchi at gmail.com> ---
I found this issues:
a. "FAIL baseURL supplied [...]" from openarchives.org validation site
b. "FAIL Bad earliestDatestamp [...]"
c. validator.oaipmh.com warning about Content-Type
d. Koha XML output doesn't validate against OAI-PMH XML schema.
(In reply to Tomás Cohen Arazi from comment #11)
> (a) Our fix for the XSLT masked the underlying bug (our code doesn't know
> about the proxypass we do, and then Plack answers what he knows,
> /opac/oai.pl).
So this bug should be reopened? bibkat.no doesn't fail beacause it doesn't use
Plack?
To test I found this command useful:
curl -s "https://sites/cgi-bin/koha/oai.pl?verb=Identify" | xmllint --format -
| grep baseURL
> (b) earliestDatestamp is read directly from the DB and not formatted as
> rfc3339 as it should).
(b) responseDate is well formed, earliestDatestamp is not.
>From XSD: "Tightened definition of UTCdatetimeType to enforce the restriction
to UTC Z notation."
responseDate and expirationDate should be YYYY-MM-DDThh:mm:ssZ
https://www.openarchives.org/OAI/openarchivesprotocol.html#DatestampsResponses
So Koha should convert to this format what it gets from the DB.
(In reply to Tomás Cohen Arazi from comment #12)
> According to my interpretation of the spec, the test is not correct:
> https://www.openarchives.org/OAI/openarchivesprotocol.html#HTTPResponseFormat
(c) You are right, seems a problem of that validator. I'll contact site creator
(Vangelis Banos).
Also acording to RFC7303 (9.2) which obsoletes RFC3023 (3).
On websites returning text/html validator.oaipmh.com oddly say they return a
correct text/xml.
This problem is very common. A FOAF tested al the URLs from
http://opendoar.org/OAIbaseURLs.php and get
575 text/xml
468 text/html
307 application/xml
(d) Koha XML sometimes invalid, tested with:
wget -q "https://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"
wget -q "https://demo.bibkat.no/cgi-bin/koha/oai.pl?verb=ListSets" -O test.xml
xmllint --noout --schema OAI-PMH.xsd test.xml
- When no set is defined Koha incorrectly output <ListSets/>. A set (child
element of ListSets) should always be there, minOccurs default value is 1.
Otherwhise should be:
<error code="noSetHierarchy">No sets defined</error>, AFAIK even if tehere is
no error.
- setDescription is not a string, is descriptionType ##other, "defined by a
community" (?)
https://www.w3.org/TR/xmlschema-0/#ref34
https://www.w3.org/2005/07/xml-schema-patterns.html
https://www.openarchives.org/OAI/2.0/guidelines-repository.htm#setDescription
https://www.openarchives.org/OAI/openarchivesprotocol.html#Set and 4.6
ListSets > Examples
--
You are receiving this mail because:
You are watching all bug changes.
More information about the Koha-bugs
mailing list