Research in programming Wikidata/Academic institution

This research is devoted to the study of Academic institution presented at Wikidata. With the help of SPARQL queries to Wikidata, the following tasks are solved:

  • Building a list of all academic institutions;
  • Building a histogram of academic institutions, which are sorted by date of foundation;
  • Display on the map all academic institutions around the world.

In the course of the work, coordinates of the location of the Wikidata (Academic institution) objects have been filled in Wikidata. It was made conclusions about the completeness and incompleteness of information, presented in Wikipedia and the Wikidata — it was shown numder of academic institutions with filled property "coordinate location". Also there were identified years, in which the largest number of academic institutions were founded.

List of academic institutions edit

Given:

Let's build a list of all academic institutions.

#added 2017-02
#added 2017-02
#List of `instances of` "academic institution" 
SELECT ?academic_institution ?academic_institutionLabel
WHERE
{
    ?academic_institution wdt:P31 wd:Q4671277. #academic institution
    SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}

SPARQL query, 3892 records.

👍> The most complete and well-developed academic institutions on the Wikimedia are: Balashov branch of Saratov State University, Moscow State Institute of Culture, Samara State Regional Academy.

👎> Almost empty and uninformative academic institutions were: Novorossiysk Polytechnic Institute, Rostov College of Arts, Omsk State Institute of Service.

The date of the founding of academic institutions in countries around the world edit

Let's build a histogram of academic institutions sorted by the date of foundation and located in different countries.

Given:

#defaultView:BarChart
SELECT DISTINCT  (SAMPLE(?year) AS ?year) (COUNT(?year) AS ?count) (SAMPLE(?_academicLabel) AS ?_academicLabel) WHERE {
  ?object (wdt:P31) wd:Q4671277. #academic institution
  BIND(str(YEAR(?_inception)) AS ?year) 
  ?object wdt:P571 ?_inception. #date of foundation
  ?object wdt:P17 ?_country. #country
  ?_country rdfs:label ?_academicLabel.
  FILTER((LANG(?_academicLabel)) = "en") 
}
GROUP BY ?_inception ?_country 
ORDER BY ?year ?_inception

As of May 2017, SPARQL query has returned 381 records. As of december number of records have been increased to 1097 — their quantity has been increased 3 times. Data is shown on histogram for 18.05.2017.

 
Histogram which shows the date of foundation of academic institutions all over the world. 18.05.2017


According to the first histogram for May 2017, it can be determined that the largest number (15) of academic institutions was established in 2002. Let's build new histogram for 07.12.2017

 
Histogram which shows the date of foundation of academic institutions all over the world. 07.12.2017


According to the second histogram for December 2017, it is clear that the leaders in the number of founded academic institutions (35) are 2002 and 2007 years.

Academic institutions of Russia edit

Let's build a SPARQL query to find how many information about domestic academic institutions can be found in Wikidata.

Given:

#Number of academic institutions  in Russia, USSR and Russian Empire.
SELECT ?academic_institution ?academic_institutionLabel ?countryLabel WHERE {
  ?academic_institution wdt:P31 wd:Q4671277. #academic institution
  
  ?academic_institution wdt:P17 ?country.       #country
  FILTER(?country = wd:Q34266 || ?country = wd:Q15180 || ?country = wd:Q159). 
                                                #Russian Empire, USSR, Russia
  
  SERVICE wikibase:label { bd:serviceParam wikibase:language "ru". }
}

SPARQL query, 27 records.

Category "Academic institutions in Russia" of English Wikipedia contains information about several dozen academic institutions located in Russia. Russian Wikipedia does not have pages that are associated with the concept of an academic institution. Thus most of the information about academic institutions in Russia is presented in the English Wikipedia

Academic institutions on the world map edit

Let's display on the map academic institutions and higher educational institutions around the world.

Given:

#List of `coordinate location` "academic institution and higher educarional institution"
#defaultView:Map
SELECT ?academic_institutionLabel ?academic_institutionDescription ?coord
WHERE
{
  { ?academic_institution wdt:P31 wd:Q4671277; wdt:P625 ?coord } UNION  #coordinate location of academic institutions
  { ?academic_institution wdt:P31 wd:Q38723; wdt:P625 ?coord } #coordinate location of higher education institutions
                            
  
  SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}

SPARQL query, 1603 records.

Completeness of Wikidata: academic institutions around the world edit

Academic institution (according to the English Wikipedia) at the end of studying allows a person to obtain a certain academic degree or graduate qualification(like Bachelor, Master). Thus, all higher educational institutions can be academic institutions.

Statistics of universities:

As can be seen from last SPARQL-query, only 1604 higher educational institutions have filled property coordinate location (P625). This is due to the fact that small number of Wikimedia objects — about 8% — are filled correctly and fully.

A list of academic institutions with the empty property coordinate location edit

Let's list the academic institutions and higher educational institutions with the empty property coordinate location (P625) in order to show incompleteness of Wikidata.

Given:

#List of `coordinate location` "academic_institution without a coordinate location" 
SELECT ?academic_institution ?academic_institutionLabel ?academic_institutionDescription ?coord 
WHERE
{
    {?academic_institution wdt:P31 wd:Q4671277} UNION #academic_institution
    {?academic_institution wdt:P31 wd:Q38723} #higher education institution
    MINUS { ?academic_institution wdt:P625 [] } . #coordinate location of organization
    SERVICE wikibase:label { bd:serviceParam wikibase:language "en" }
}

SPARQL query, 3058 records.

Incompleteness of Wikidata: academic institutions around the world edit

As can be seen from SPARQL-query, only 1603 academic institutions and higher educational institutions have the filled coordinate location property. The data is visible on the version 1.0 image.

 
Academic institutions with filled property "coordinate location" v1.0.


Let's improve this situation and fill the property "coordinate location" of one hundred objects "academic institution", thereby increasing the number of records returned by the SPARQL-query from 1603 to 1703. The data is visible in the version 1.1 image.

 
Academic institutions with filled property "coordinate location" v1.1.


Future work edit

  1. Find out the most popular language among acedemic institution (property "language used"). For clarity, build a bubble diagram.
  2. Find academic institutions with a website (property "official website"). Mark them on the map.
  3. Rate of academic institutions by the number of awards according to the Wikidata (property "award received").

Tests edit


SPARQL-queries with answers:

Notes edit

References edit

  • "Ranking of World Universities". Retrieved 2017-05-13.
  • Krizhanovsky A., Horolskiy E. Foundation dates and geography of academic institutions // Authorea
  • Krizhanovsky A., Horolskiy E. Foundation dates and geography of academic institutions // Nauchkor