Articles | Volume 5, issue 2
https://doi.org/10.5194/soil-5-177-2019
https://doi.org/10.5194/soil-5-177-2019
Original research article
 | 
17 Jul 2019
Original research article |  | 17 Jul 2019

Word embeddings for application in geosciences: development, evaluation, and examples of soil-related concepts

José Padarian and Ignacio Fuentes

Viewed

Total article views: 3,879 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
2,587 1,145 147 3,879 157 150
  • HTML: 2,587
  • PDF: 1,145
  • XML: 147
  • Total: 3,879
  • BibTeX: 157
  • EndNote: 150
Views and downloads (calculated since 29 Jan 2019)
Cumulative views and downloads (calculated since 29 Jan 2019)

Viewed (geographical distribution)

Total article views: 3,879 (including HTML, PDF, and XML) Thereof 3,263 with geography defined and 616 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 15 Nov 2025
Download
Short summary
A large amount of descriptive information is available in geosciences. Considering the advances in natural language it is possible to rescue this information and transform it into a numerical form (embeddings). We used 280764 full-text scientific articles to train a language model capable of generating such embeddings. Our domain-specific embeddings (GeoVec) outperformed general domain embedding tasks such as analogies, relatedness, and categorisation, and can be used in novel applications.
Share