Overview
Overview
This dataset comprises comprehensive records that include detailed information about various cities, countries, significant dates, organizations, and individuals. Each entry is meticulously verified against Wikidata to ensure accuracy and reliability. The dataset is enriched with additional attributes, offering valuable insights into historical events, cultural significance, geographical coordinates, environmental data, and much more. This makes the dataset highly suitable for geographical analysis, historical research, data-driven decision-making, and academic purposes. By providing a structured and enriched set of data, it aims to be a valuable resource for analysts, researchers, and professionals across various fields.
File Format
The dataset is stored in a spreadsheet CSV format for easy access and analysis.
Data Dictionary
Field Name | Data Type | Description | Example |
---|---|---|---|
City | String | The name of the city associated with the record. | Andorra la Vella, Santiago, Beijing, Havana |
Country | String | The country in which the city is located. | Andorra, Chile, China, Cuba |
Title | String | A title or descriptor for the record, which might refer to an event, landmark, or notable subject related to the city. | Andorra la Vella, Tajo Abierto, Embassy of Zimbabwe, London, Renminbi |
Link | URL String | A hyperlink (typically a Wikipedia URL) that points to a page with detailed information about the title or subject. | https://en.wikipedia.org/wiki/Andorra_la_Vella |
Extracted Years | String/List | One or more years (often comma‑separated) extracted from the text that highlight significant dates or events. | "2015" or "2000, 2012" |
People | String | Names and enriched details of individuals mentioned in the record. This may include additional metadata such as summaries, dates, or other attributes extracted from the text. | full_name: Francisca Valenzuela | summary: Francisca Valenzuela (born March 17, 1987, …) |
Organization | String | Names and enriched details of organizations linked to the record. These might include companies, institutions, or other entities along with additional metadata. | North-South Carrier or Groupe Latécoère | Description: Aeronautics company | Founded: 1957-01-01T00:00:00Z |
Locations | String | Additional location names referenced in the record that are not the primary city. | London, Paris (if mentioned) |
Coordinates | String | Geographic coordinates provided in a textual format (commonly a WKT “Point(longitude latitude)” string) indicating the physical location of the record. | "Point(1.49414 42.5045)" or "Point(-7.216314 39.604008)" |
Historical Events | String | A brief reference to any historical events associated with the record. | Magellan-Elcano expedition, Siege of Bogotá, shipwrecking |
Cultural Significance | String | Descriptive information about the cultural, social, or historical importance of the city or subject. | Andorra la Vella is the capital and largest city of Andorra. It is located high in the east Pyrenees, between France and Spain… |
Wikipedia Summary | String | A truncated summary of the Wikipedia article that offers an overview of the subject. | Andorra la Vella is the capital and largest city of Andorra. It is located high in the east Pyrenees, between France and Spain… |
Wikipedia Link | URL String | A direct URL to the full Wikipedia article that provides additional detailed information about the record. | https://en.wikipedia.org/wiki/Andorra_la_Vella |
Content | String | The main body of text for the record. This field contains detailed narrative information such as history, demographics, and notable facts about the city or subject. | Andorra la Vella is the capital and largest city of Andorra. It is located high in the east Pyrenees, between France and Spain. As of 2015, the city had a population of 22,886… |
Usage
-
This enriched dataset provides comprehensive information about locations, people, and organizations, making it valuable for geographical analysis, historical research, data-driven decision-making, academic purposes, and more.
-
Use the provided coordinates for mapping and spatial analysis.
-
Explore the additional attributes for deeper insights into the historical, cultural, and environmental context of each record.