Authors: Bernardo Magnini, Manuela Speranza, Dimitris Mavroeidis, Christian Spurk, Olivier Hamon, Christian Girardi
Date: October 8, 2012
Contents
This document is a guide for the users of META-SHARE. As such, it describes the functionalities implemented in V3.0 of META-SHARE and provides the users with step-by-step explanations of how to exploit them.
META-SHARE (http://www.meta-share.eu/ and http://www.meta-share.org/) is a network of repositories of language resource (LRs), including both language data and language tools, described through a set of metadata (see T4ME Deliverable 7.2.4 for a detailed description of the schema of the metadata), aggregated in central inventories allowing for uniform search and access to resources. LRs can be both open and with restricted access rights, either for free or for-a-fee.
More specifically, META-SHARE V3.0 offers to the user the possibility to:
When looking for a LR, the user can perform both keyword-based search and browsing of the catalogue.
Both a simple search and a faceted search mechanism are available to search through META-SHARE.
The user can have access to the entire META-SHARE catalogue through a simple keyword-based search:
The results page (see Figure 2) lists all the LRs matching the query (if the user doesn’t type any word, the result consists of the entire catalogue). For each LR, the following metadata information is provided: resource name, resource short name (if available), resource type, media type (see Figure 3) and language (if available). The number of downloads and the number of views are also given.
On the left pane of the results page, there is a list of facets (or filters). The user can filter the search results by any of the following fields: - Language - Resource Type - Media Type - Availability - Licence - Restrictions of Use - Validated - Foreseen Use - Use is NLP Specific - Resource Creator - Linguality Type - Multilinguality Type - Modality Type - MIME Type - Conformance to Standards / Best Practices - Domain - Geographic Coverage - Time Coverage - Subject - Language Variety
Filters can be combined with search terms entered in the search box. The number of LRs available, if a specific filter is selected, are reported alongside each group of LRs. When the user selects a filter, it’s typeface changes to bold. Multiple filters can be applied. For instance, if a user requires a parallel corpus of English and French, he/she can select both “French” and “English” in the “Language” field of the filtering pane as well as “corpus” in the “Resource Type” field. Filters can be removed by clicking again on the selected field.
The user can browse the catalogue as follows:
The LRs can be ordered by using the select box on top right of the results page by selecting one of the following item:
The user can click on the name of a LR from the results page obtained by any type of search (see previous Section) to open the page with the details for that LR (see Figure 4). Information about a LR includes all the metadata information available for that resource (e.g. a textual description, the licensing conditions under which it is distributed, etc.) organized using 4 panes, which are described in detail below. Numbers and dates are presented according to administrative settings. For instance, if the “LANGUAGE_CODE” is set to “en-gb”, dates will be of the form “DD/MM/YYYY”, and numbers of the form “XX,XXX.XX”.
The top pane provides vital information about the resource, like the resource name, short name and description. If the resource provider has supplied the above information in other languages, these can be shown on the click of a button. If a URL for the resource is provided, this is also shown here. In addition, on the top right corner of the top pane, statistical information about the number of times the resource was viewed, updated and downloaded are provided.
The bottom left pane provides legal and contact information. The former include license names and attributes, covering the wealth of information provided by the metadata schema. The latter include the person or organization to be contacted for details about the resource. Only the names of contact persons and organizations are shown at first. When clicked, a small frame is opened to reveal full details. All emails for persons and organizations are protected from bots and crawlers. URLs are truncated to prevent cluttering the resource view. See Figure 5 for more details.
The middle pane provides media information. Available media types are: text, audio, video, image, n-gram text and numerical text. Each of these types is presented in an individual tab. When multiple instances of these types exist, they are presented in sub-tabs, as shown in Figure 6.
The bottom right pane provides metadata creation information. Information about how, when and why the resource was created are presented. Derived publications, manuals, associated resources and validation information are also part of the right pane.
At the bottom of the page, there are recommendations about LRs that could be of interest to the user (see Figure 7). Recommended LRs are extracted from usage statistics (see Statistics).
If a LR is directly provided through META-SHARE, the user can download it from the page with the details for that LR (Figure 4). Steps to follow are:
Users can register to META-SHARE and log in to META-SHARE in order to have access to further functionalities of the portal, such as downloading a LR. Being a registered user is also prerequisite for becoming a LR provider (see the META-SHARE User Manual for more information).
3.4.1 Register as a new user In order to register to META-SHARE and get an account:
Registered users can use their credentials to log in to META-SHARE:
At the end of the working session, the user can log out by clicking on the “Logout” button at the top right of the home page.
If the user forgets its password, the system offers the possibility to retrieve it:
The user can edit his/her Profile:
In order to change the First Name, Last Name, or Email, the user should contact the META-SHARE Helpdesk at helpdesk-technical@meta-share.eu
The user can apply to an editor group membership:
The application is moderated. When accepted by an editor group manager, the user receives a notification email (see the Applying for Editor Group Memberships).
The user can apply to an organization membership:
The application is moderated. When accepted by an organization manager, the user receives a notification email (see the Applying for Organization Memberships).
Users can access a discussion forum where the META-SHARE community gives help regarding Legal, Technical and Metadata aspects. Click the “Community” button in the top menu to access the forum.
The “Statistics” button from the header tab allows the access to various types of statistical information about the use of META-SHARE node.
The first tab is the “META-SHARE node visits statistics”. By default, the system shows “the most viewed resources”; the user can select the other lists from the select box on top of the statistics page (see Figure 9).
Five different lists are available: - the most viewed resources; - the top queries; - the latest queries; - the top downloaded resources. - the latest updated resources.
The user can filter the statistics results by choosing one of the lists above, combining it with other filters: Date filter and/or country of Provenance filter.
On the results tab, it is shown the resources results are shown (see Figure 9). The user can visit each resource page by clicking on the resource name. In each resource row in the results page, the following information is displayed:
The links “Previous” and “Next” at the top of the page are provided for easy navigation through sub page results.
This panel shows which metadata have been used to describe the META-SHARE linguistic resources.
To access this page the user should do the following (see Figure 10):
Two different filters appear at the top of the statistics page:
The user can select from the two filters to have more specific statistics. By activating one of the filters, the metadata are showed and grouped as defined in the META-SHARE model.
Under these filters, the user can find the metadata in rows. If a metadata is used two counters are displayed. The first counter says how many times the metadata has been filled in with a certain value, while the second counter says the number of resources for which the metadata has been used.
For instance the counters of the “Annotation type” element 129/25, (see Figure 10), means that this required metadata has been filled in 129 times (possibly with redundant values), and that 25 different resources have been described with that metadata.
Each used metadata can be also clicked showing a table with all filled in values (see for example the values of “Annotation type” in Figure 10).
The last statistics tab “My resources” (see Figure 11), is activated whenever a user is logged in. This tab is used to control the status of the user’s resources.
For each resource the following information are available:
Enter search terms or a module, class or function name.