Articles | Volume 5, issue 2
https://doi.org/10.5194/soil-5-177-2019
https://doi.org/10.5194/soil-5-177-2019
Original research article
 | 
17 Jul 2019
Original research article |  | 17 Jul 2019

Word embeddings for application in geosciences: development, evaluation, and examples of soil-related concepts

José Padarian and Ignacio Fuentes

Viewed

Total article views: 4,057 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
2,697 1,203 157 4,057 169 167
  • HTML: 2,697
  • PDF: 1,203
  • XML: 157
  • Total: 4,057
  • BibTeX: 169
  • EndNote: 167
Views and downloads (calculated since 29 Jan 2019)
Cumulative views and downloads (calculated since 29 Jan 2019)

Viewed (geographical distribution)

Total article views: 4,057 (including HTML, PDF, and XML) Thereof 3,434 with geography defined and 623 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 15 Jan 2026
Download
Short summary
A large amount of descriptive information is available in geosciences. Considering the advances in natural language it is possible to rescue this information and transform it into a numerical form (embeddings). We used 280764 full-text scientific articles to train a language model capable of generating such embeddings. Our domain-specific embeddings (GeoVec) outperformed general domain embedding tasks such as analogies, relatedness, and categorisation, and can be used in novel applications.
Share