Wikipedia demo - refresh this page to get more random examples

Categories assigned to text using ngram model (with relative ranking score):

Category assigned by SVM classifier using a linear model:

mathematics

Summary:

[2] The Roman Catholic diocese of Calabozo, embracing the section of Guárico and portions of the sections of Apure, Zamora, Portuguesa, Cojedes and Guzman Blanco, was created 7 March 1863 by Pius IX as a suffragan of the Archdiocese of Caracas (Santiago de Venezuela), and its first bishop was consecrated 30 October 1881. [1] It was a diocese until 1995. The Archdiocese of Calabozo is a Roman Catholic archdiocese in Venezuela.

Similar documents

Original text:

The Archdiocese of Calabozo is a Roman Catholic archdiocese in Venezuela. [1] It was a diocese until 1995. [2] The Roman Catholic diocese of Calabozo, embracing the section of Guárico and portions of the sections of Apure, Zamora, Portuguesa, Cojedes and Guzman Blanco, was created 7 March 1863 by Pius IX as a suffragan of the Archdiocese of Caracas (Santiago de Venezuela), and its first bishop was consecrated 30 October 1881.

Entities resolved to DBPedia URIs:

Category Name DBPedia URI (if available)
countries Venezuela http://dbpedia.org/resource/Venezuela
cities Caracas http://dbpedia.org/resource/Caracas
cities Santiago http://dbpedia.org/resource/Santiago,_Isabela


Entities:

Roman Catholic

Location in text: sentence index: 0 starting index in sentence: 6 ending index in sentence +1: 8

Entity data: {:entity=>"Roman Catholic", :gender=>:male, :type=>:person}

Roman Catholic

Location in text: sentence index: 0 starting index in sentence: 20 ending index in sentence +1: 22

Entity data: {:entity=>"Roman Catholic", :gender=>:male, :type=>:person}

Venezuela

Location in text: sentence index: 0 starting index in sentence: 10 ending index in sentence +1: 11

Entity data: {:entity=>"Venezuela", :gender=>:none, :type=>:place, :place_type=>"country"}

Caracas

Location in text: sentence index: 0 starting index in sentence: 63 ending index in sentence +1: 64

Entity data: {:entity=>"Caracas", :gender=>:none, :type=>:place, :place_type=>"country_capital"}

Venezuela

Location in text: sentence index: 0 starting index in sentence: 66 ending index in sentence +1: 67

Entity data: {:entity=>"Venezuela", :gender=>:none, :type=>:place, :place_type=>"country"}


Part of speech tags with (very experimental) anaphora resolution annotations (bold font):

The/DT, Archdiocese/NN, of/IN, Calabozo/NN, is/VBZ, a/DT, Roman/NNP, Catholic/NNP, archdiocese/NN, in/IN, Venezuela/NNP, ./., 1/NN, It/PRP/{1}, was/VBD, a/DT, diocese/NN, until/IN, 1995/NN, 2/NN, The/DT, Roman/NNP, Catholic/NNP, diocese/NN, of/IN, Calabozo/NN, embracing/VBG, the/DT, section/NN, of/IN, Gu/NN, rico/NN, and/CC, portions/NNS, of/IN, the/DT, sections/NNS, of/IN, Apure/NN, Zamora/NNP, Portuguesa/NN, Cojedes/NN, and/CC, Guzman/NNP, Blanco/NNP, was/VBD, created/VBN, 7/NN, March/NNP, 1863/NN, by/IN, Pius/NNP, IX/NN, as/IN, a/DT, suffragan/NN, of/IN, the/DT, Archdiocese/NN, of/IN, Caracas/NNP, Santiago/NNP, de/FW, Venezuela/NNP, and/CC, its/PRP$, first/NN, bishop/NN, was/VBD, consecrated/NN, 30/NN, October/NNP, 1881/NN, ./.


Segmented sentences from raw text:

0 ["The", "Archdiocese", "of", "Calabozo", "is", "a", "Roman", "Catholic", "archdiocese", "in", "Venezuela", ".[1]", "It", "was", "a", "diocese", "until", "1995", ".[2]", "The", "Roman", "Catholic", "diocese", "of", "Calabozo", ",", "embracing", "the", "section", "of", "Guárico", "and", "portions", "of", "the", "sections", "of", "Apure", ",", "Zamora", ",", "Portuguesa", ",", "Cojedes", "and", "Guzman", "Blanco", ",", "was", "created", "7", "March", "1863", "by", "Pius", "IX", "as", "a", "suffragan", "of", "the", "Archdiocese", "of", "Caracas", "(Santiago", "de", "Venezuela", ")", ",", "and", "its", "first", "bishop", "was", "consecrated", "30", "October", "1881", "."]