Analysis of Google News coverage: A comparative study of Brazil, Colombia, Mexico, Portugal, and Spain
DOI:
https://doi.org/10.26439/contratexto2024.n42.7212Keywords:
Google News, digital media, news, news aggregators, data analysisAbstract
This study aims to examine the news coverage provided by Google News across five Ibero-American countries, including three from Latin America and two from Europe: Brazil, Colombia, Mexico, Portugal, and Spain. The main focus is to highlight the differences and similarities in news presentation within diverse contexts, evaluating the presence and distribution of news using quantitative indicators, and analyzing the predominant content in each country’s news, based on a dataset collected between January 2 and January 31, 2024. This includes examining news sources, geographic coverage, prominent figures, and the prevalence of sensational elements through the identification of clickbaits. Our research employs statistical analyses and algorithmic solutions in natural language processing and artificial intelligence to generate results. The analyses revealed consistency in the daily delivery of content by Google News, with specific variations in the update rate across the studied countries. A diversity of news sources was observed, with a greater tendency toward local news and frequent mentions of politicians, celebrities, and businesspeople. In addition, the analyses uncovered a significant presence of clickbait, with variations across the countries and topics.
Downloads
References
Alzahrani, S. M. (2013). Building, profiling, analysing and publishing an Arabic news corpus based on Google News RSS feeds. In R. E. Banchs, F. Silvestri, T. Y. Liu, M. Zhang, S. Gao, & J. Lang -Eds.), Lecture notes in computer science: Vol. 8281. Information retrieval technology (pp. 488-499). Springer. https://doi.org/10.1007/978-3-642-45068-6_42
Basch, C. H., Hillyer, G. C., & Jacques, E. T. (2022). News coverage of colorectal cancer on Google News: Descriptive study. JMIR Cancer, 8(2), Article e39180. https://doi.org/10.2196/39180
Calzada, J., & Gil, R. (2020). What do news aggregators do? Evidence from Google News in Spain and Germany. Marketing Science, 39(1), 134-167. https://doi.org/10.1287/mksc.2019.1150
Christodoulou, C. (2024). XLM-RoBERTa-Multilingual-Clickbait-Detection. Hugging Face. https://huggingface.co/christinacdl/XLM_RoBERTa-Multilingual-Clickbait-Detection
Cobos, T. L. (2018). Perceptions and experiences about Google News from the editors of Latin-American news media indexed in the editions of Colombia and Mexico. Estudios sobre el Mensaje Periodístico, 24 (2), 1183-1198. https://hdl.handle.net/20.500.12585/9225
Cobos, T. L. (2020). Journalism industries in the internet era: The case of Colombian news media outlets in Google News Colombia. Contratexto, (33), 85-104. https://doi.org/10.26439/contratexto2020.n033.4785
Cobos, T. L. (2021). Origin and weight of news media outlets indexed on Google News: An exploration of the editions from Brazil, Colombia, and Mexico. Brazilian Journalism Research, 17(1), 28-63. https://doi.org/10.25200/BJR.v17n1.2021.1331
Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzmán, F., Grave, E., Ott, M., Zettlemoyer, L., & Stoyanov, V. (2020). Unsupervised cross-lingual representation learning at scale. Proceedings of the 58th annual meeting of the Association for Computational Linguistics (pp. 8440-8451). https://doi.org/10.18653/v1/2020.acl-main.747
Cordeiro, D. F. (2024). Perspectivas en contraste: análisis comparativo cuantitativo España y Brasil de la cobertura del conflicto israelí-palestino en Google News: análise comparativa quantitativa Espanha e Brasil da cobertura do conflito israelo-palestino no Google News. Documentación de las Ciencias de la información, 47, 15-25. https://doi.org/10.5209/dcin.92187
Cozza, V., Hoang, V.T., Petrocchi, M., Spognardi, A. (2016). Experimental measures of news personalization in Google News. In S. Casteleyn, P. Dolog, & C. Pautasso. (Eds.), Current trends in web engineering (pp. 93–104). Springer. https://doi.org/10.1007/978-3-319-46963-8_8
Das, A. S., Datar, M., Garg, A., & Rajaram, S. (2007). Google news personalization: scalable online collaborative filtering. WWW ‘07: Proceedings of the 16th international conference on World Wide Web, New York, NY, USA, 271-280. https://doi.org/10.1145/1242572.1242610
Devlin, J., Chang, M., Lee, K, & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In J. Burstein, C. Doran, & T. Solorio. (Eds.), Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (pp. 4171-4186). Association for Computational Linguistics. https://doi.org/10.48550/arXiv.1810.04805
Du, K., & Song, J. (2022). The impact of geotargeting on household information acquisition: Evidence from a Google News redesign. Research Policy, 51(10), Article 104596. https://doi.org/10.1016/j.respol.2022.104596
Evans, R., Jackson, D., & Murphy, J. (2022). Google news and machine gatekeepers: Algorithmic personalisation and news diversity in online news search. Digital Journalism, 11(9), 1682-1700. https://doi.org/10.1080/21670811.2022.2055596
Fischer, S., Jaidka, K., & Lelkes, Y. (2020). Auditing local news presence on Google News. Nature Human Behaviour, 4(12), 1236-1244.
Fu, J., Liang, L., Zhou, X., & Zheng, J. (2017). A convolutional neural network for clickbait detection. In S. Li, Y. Dai, & Y. Cheng. (Eds.), 2017 4th International Conference on Information Science and Control Engineering (ICISCE) (pp. 6-10). CPS. https://doi.org/10.1109/ICISCE.2017.11
Giomelakis, D. (2023). Semantic search engine optimization in the news media industry: Challenges and impact on media outlets and journalism practice in Greece. Social Media + Society, 9(3). https://doi.org/10.1177/20563051231195545
Google. (2024). Get started with Google News. https://support.google.com/googlenews/answer/9005669?hl=en&co=GENIE.Platform%3DAndroid
Guallar, J. (2015). Prensa digital en 2013-2014. Anuario ThinkEPI, 9, 153–160. https://doi.org/10.3145/thinkepi.2015.37
Guallar, J., Abadal, E., & Codina, L. (2013). Sistemas de acceso a la información de prensa digital: tipología y evolución. Investigación Bibliotecológica: Archivonomía, Bibliotecología e Información, 27(61), 29-52. https://doi.org/10.1016/S0187-358X(13)72553-X
Haim, M., Graefe, A., & Brosius, H. B. (2018). Burst of the filter bubble? Effects of personalization on the diversity of Google News. Digital Journalism, 6(3), 330-343. https://doi.org/10.1080/21670811.2017.1338145
Hong, C., Chen, C., & Chiu, C. (2006). New word extraction utilizing Google News corpuses for supporting lexicon-based Chinese word segmentation systems. The 2006 IEEE International Joint Conference on Neural Networks Proceedings, Vancouver, BC, 3040-3046. https://doi.org/10.1109/IJCNN.2006.247263
Hong, C., Chen, C., & Chiu, C. (2009). Automatic extraction of new words based on Google News corpora for supporting lexicon-based Chinese word segmentation systems. Expert Systems with Applications: An International Journal, 36(2), 3641-3651. https://doi.org/10.1016/j.eswa.2008.02.013
Joshi, D., & Gatica-Perez, D. (2006). Discovering groups of people in google news. Proceedings of the 1st ACM International Workshop on Human-Centered Multimedia (HCM ‘06), New York, NY, USA, 55-64. https://doi.org/10.1145/1178745.1178757
Le, H., Maragh, R., Ekdale, B., High, A., Havens, T., & Shafiq, Z. (2019). Measuring political personalization of Google News search. In L. Liu, & R. White (Eds.). WWW ’19: The Web Conference 2019 (pp. 2957-2963). Association for Computing Machinery. https://doi.org/10.1145/3308558.3313682
Li, J., Sun, A., Han, J., & Li, C. (2022). A survey on deep learning for named entity recognition: extended abstract. IEEE Transactions on Knowledge and Data Engineering, 34(1), 50-70. https://doi.org/10.1109/TKDE.2020.2981314
Lopezosa, C., Codina L., & Rovira, C. (2019). Visibilidad web de portales de televisión y radio en España: ¿qué medios llevan a cabo un mejor posicionamiento en buscadores? Universitat Pompeu Fabra, Barcelona. https://repositori.upf.edu/handle/10230/36234
Lopezosa, C., Giomelakis, D., Pedrosa, L., & Codina, L. (2024). Google Discover: uses, applications and challenges in the digital journalism of Spain, Brazil and Greece. Online Information Review, 48(1), 123-143. https://doi.org/10.1108/OIR-10-2022-0574
Lopezosa, C., Vállez, M., & Guallar, J. (2024). The vision of Google News from the academy: scoping review. Doxa Comunicación, 38, 317-332. https://doi.org/10.31921/doxacom.n38a1891
Mitchell, R. (2024). Web scraping with Python: Data extraction from the modern web. O’Reilly Media.
Montejo-Ráez, A., Perea-Ortega, J. M., Díaz-Galiano, M. C., & Ureña-López, L. A. (2009). SINAI at INFILE 2009: Experiments with Google News. CEUR Workshop Proceedings: Vol. 1175. https://ceur-ws.org/Vol-1175/CLEF2009wn-INFILE-MontejoRaezEt2009.pdf
Montejo-Ráez, A., Perea-Ortega, J. M., Díaz-Galiano, M. C., & Ureña-López, L. A. (2010). Experiments with Google News for filtering newswire articles. In C. Peters, G. M. Nunzio, M. Kurimo, T. Mandl, D. Mostefa, A. Peñas, & G. Roda (Eds.), Lecture Notes in Computer Science: Vol. 6241. Multilingual Information Access Evaluation I. Text Retrieval Experiments (pp. 381-384). Springer. https://doi.org/10.1007/978-3-642-15754-7_46
Müller, M. S., Cabecinhas, R., & Santos Silva, D. (2023). Cultural journalism in Brazil and Portugal: cross-country analysis. Brazilian Journalism Research, 19(1), Article e1546. https://doi.org/10.25200/BJR.v19n1.2023.1546
Negredo, S. (2023). Uno de cada cuatro internautas españoles dice usar Google News, y uno de cada cinco, Discover. Digital News Report España. https://www.unav.edu/web/digital-news-report/entradas/-/blogs/uno-de-cada-cuatro-internautas-espanoles-dice-usar-google-news-y-uno-de-cada-cinco-discover
Newman, N., Fletcher, R., Eddy, K., Robertson, C. T., & Nielsen R. K. (2023). Reuters Institute digital news report 2023. Reuters Institute for the Study of Journalism. https://reutersinstitute.politics.ox.ac.uk/sites/default/files/2023-06/Digital_News_Report_2023.pdf
Park, C. S. (2022). Reading a snippet on a news aggregator vs. clicking through the full story: Roles of perceived news importance, news efficacy, and news-finds-me perception. Journalism Studies, 23(11), 1350-1376. https://doi.org/10.1080/1461670X.2022.2086160
Patel, N. (2019). Cómo publicar tu sitio en Google News y generar más tráfico en tiempo real. Neilpatel.com. https://neilpatel.com/es/blog/como-publicar-tu-sitio-en-google-news-y-generar-mas-trafico-en-tiempo-real/
Pedrosa, L., & de Morais, O. J. (2021). Visibilidade web nos buscadores: Fatores algorítmicos de SEO on-page (FAOPs) como técnica e prática jornalística. Estudios sobre el Mensaje Periodístico, 27(2), 579-591. https://doi.org/10.5209/esmp.71291
Schroeder, R., & Kralemann, M. (2005). Journalism ex Machina-Google News Germany and its news selection processes. Journalism Studies, 6(2), 245-247. https://doi.org/10.1080/14616700500057486
Seror, J., Amar, A., Braz, L., & Rouzier, R. (2010). The Google News effect: Did the tainted milk scandal in China temporarily impact newborn feeding patterns in a maternity hospital? Acta Obstetricia et Gynecologica Scandinavica, 89(6), 823-827. https://doi.org/10.3109/00016349.2010.484046
Veremyev, A., Semenov, A., Pasiliao, E. L., & Boginski, V. (2019). Graph-based exploration and clustering analysis of semantic spaces. Applied Network Science, (4), Article 104. https://doi.org/10.1007/s41109-019-0228-y
Vermeer, S., Trilling, D., Kruikemeier, S., & de Vreese, C. (2020) Online news user journeys: The role of social media, news websites, and topics. Digital Journalism, 8(9), 1114-1141. https://doi.org/10.1080/21670811.2020.1767509
Watanabe, K. (2013). The western perspective in Yahoo! News and Google News: Quantitative analysis of geographic coverage of online news. International Communication Gazette, 75(2), 141-156. https://doi.org/10.1177/1748048512465546
Wilson, T. D., & Maceviciute, E. (2013). What’s newsworthy about ‘information seeking’? An analysis of Google’s News Alerts. Information Research, 18(1), Article 557. https://informationr.net/ir/18-1/paper557.html
Wilson, L. (2021). How to get your website listed in Google News. Search Engine Journal. https://www.searchenginejournal.com/how-to-get-listed-in-google-news/379701/
Wubben, S., van den Bosch, A., & Krahmer, E. (2011). Paraphrasing headlines by machine translation. Sentential paraphrase acquisition and generation using Google News. In T. Markus, P. Monachesi, & E. Westerhout (Eds.), Computational Linguistics in the Netherlands 2010: Selected Papers from the Twentieth CLIN Meeting (pp. 169-183). LOT. https://ilk.uvt.nl/~swubben/publications/clin_paraphrasing.pdf
Young, A., & Atkin, D. (2022). An agenda-setting test of Google News world reporting on foreign nations. Electronic News, 17(2), 113-132. https://doi.org/10.1177/19312431221106375
Young Lin, L. L., & Rosenkrantz, A. B. (2017). The U.S. online news coverage of mammography based on a Google News search. Academic Radiology, 24(12), 1612-1615. https://doi.org/10.1016/j.acra.2017.05.011
Published
Issue
Section
License
All of the works published are licensed under a CC BY 4.0 Creative Commons Attribution license. (updated on March 1st 2021)
The content of the journal may be shared in any material or format. The content may be adapted, contributed upon and transformed. Both possibilities are only permitted in so far as they complete the following conditions:
- Attribution: Credit must be given where it is due, a link to the license must be provided and changes, if made, must be indicated. This should be done in the manner deemed appropriate, without suggesting that the licensor promotes you or your use of the material.
Ownership rights
The patrimonial rights for Contratexto are published under a Creative Commons BY 4.0 license, allowing authors to keep the patrimonial rights to their work without restrictions.
If a work published in Contratexto were to be copied, distributed, spread, or any other activities contemplated in the aforementioned license, the author(s) and the journal must be mentioned visibly and expressly.
Self-archive
This journal allows and encourages authors to post items submitted to the journal on personal websites or institutional repositories both prior to and after publication, while providing bibliographic details that credit, if applicable, its publication in this journal.