<div dir="ltr">Hi Markus,<div><br></div><div>As you can imagine we at ADS have had to struggle with similar issues. As you mention, the goals of faithful representation of a record (in particular a person's name) and the one of discoverability run against each other.<br><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Aug 4, 2016 at 5:20 AM, Markus Demleitner <span dir="ltr"><<a href="mailto:msdemlei@ari.uni-heidelberg.de" target="_blank">msdemlei@ari.uni-heidelberg.de</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex"><br>
Several VOResource elements contain names. Again, for reliable global<br>
discoverability, such names must be given in (common) English<br>
transliteration where their original form uses non-Latin scripts.<br>
Latin letters with diacritics should also be transliterated.<br></blockquote><div><br></div><div>The transliteration of Latin letters with diacritics seems a bit harsh to me. If it's there to make sure that searches containing non-diacritic terms match the original strings with diacritics, there are other ways to do this (downgrade everything to ascii when indexing and searching). Since this is a problem that only affects searchable registries, I would investigate if the technologies currently used to host these databases allow for that, in which case there should be no extra work involved in keeping the diacritics in.</div><div> <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex">
I feel a bit bad about codifying this amount of cultural bias, but<br>
I'm convinced that for reliable discovery, we'll have to say<br>
something pretty close to that.<br></blockquote><div><br></div><div>Agree that in principle it seems unfair, but in practice nobody has complained so far when we suggest to our chinese and japanese colleagues to give us abstracts in english...</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex">
In particular on the question of names, I'm really uncertain, though.<br>
It seems patently wrong to me to have no place for names in, say,<br>
Cyrillic or Chinese or Japanese scripts. At least for elements with<br>
an explicit name element (creator, contributor, contact), it would<br>
not be hard to add an additional element (perhaps originalName?) that<br>
could legally contain non-latin letters. I'd be happy to introduce<br>
them if people asked for them and would volunteer to put out records<br>
using them.<br></blockquote><div><br></div><div>We have a field in our bibliographic data that can be used to retain the author name in its native script. Although at the moment we don't do anything useful with it the plan is to expose it and use it for indexing to help with disambiguation. I don't think you should worry about disambiguation now but it seems like a good idea to capture the faithful representation of somebody's name, so I'd vote for that. </div><div><br></div><div>BTW I noticed that Datacite says nothing of the sort and one of their examples has a name in chinese script in the <creator> field: <a href="https://schema.datacite.org/meta/kernel-3.1/example/datacite-example-complicated-v3.0.xml">https://schema.datacite.org/meta/kernel-3.1/example/datacite-example-complicated-v3.0.xml</a></div><div>The schema however allows for a title and a translated title (which is in english).</div><div><br></div><div>-- Alberto</div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex">
<br>
Any takers?<br>
<span class="gmail-HOEnZb"><font color="#888888"><br>
-- Markus<br>
</font></span></blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature"><div dir="ltr"><div>Dr. Alberto Accomazzi<br>Principal Investigator</div><div>NASA Astrophysics Data System - <a href="http://ads.harvard.edu" target="_blank">http://ads.harvard.edu</a><br>Harvard-Smithsonian Center for Astrophysics - <a href="http://www.cfa.harvard.edu" target="_blank">http://www.cfa.harvard.edu</a><br>60 Garden St, MS 83, Cambridge, MA 02138, USA</div></div></div>
</div></div></div>