EACL Hackashop on 19 April 2021: programme and proceedings now available

The EACL Hackashop on News Media Content Analysis and Automated Report Generation will have its main event on 19 April 2021, in conjunction with EACL. A pre-print of the proceedings is available from the hackashop home page, with 13 peer-reviewed workshop papers, five hackathon reports and a resource description paper.

The programme will contain a keynote and a roundtable with invited experts, in addition to the contributed poster spotlights and presentations.

Registration to the hackashop is included in the registration for EACL, i.e., there is no separate workshop/hackashop registration. Early registration ends April 7th.

EMBEDDIA Hackathon wrap-up

On Friday, February 19, we wrapped up the EMBEDDIA hackathon. In an online event, the hackathon participants presented their results and their views on the EMBEDDIA tools and identified challenges.

We like to extend our gratitude to the hackathon participants and the EMBEDDIA staff for making the hackathon a success. It was a very nice opportunity for the EMBEDDIA consortium to see our developed tools being utilized outside of the consortium for similar or newly identified NLP challenges.

Below are snapshots of the wrap-up meeting.

EMBEDDIA Hackashop: Hackathon halfway get-together

On February 10, the EMBEDDIA consortium organized a hackathon get-together of hackathon participants and EMBEDDIA staff. We used this event to check-in with the teams and present the expectations and challenges of our media partner, the Finnish News Agency (STT).

The interaction with hackathon teams was conducted via the Gather.town application and it was in the form of a tool/model/data/challenge support session. We used the Gather.town application to make the event less formal and more social. Participants were able to wander around and meet other participants and see what they are working on — or to chat with other researchers from EMBEDDIA!

Below are snapshots of today’s event.

Kick-off of the Hackashop on news media content analysis and automated report generation

Today the EMBEDDIA consortium officially kicked-off the Hackashop on news media content analysis and automated report generation. Project partners presented the projects, challenges, and data to be used in the course of the hackashop. Due to the pandemic, the hackashop will be an online event. The hackathon part of the hackashop will run from February 1-21, 2021.

Below are some snapshots from today’s event.

2nd call: Hackashop on news media content analysis and automated report generation

The EMBEDDIA consortium is happy to publish updated calls for the Hackashop on news media content analysis and automated report generation in conjunction with EACL 2021:

Call for workshop papers: paper submission deadline Jan 31 (extended)
Call for online hackathon participation:
registration by Jan 29, event on Feb 1-21

The hackashop implements a novel, dual format: (1) a traditional track with paper submissions, reviews and paper presentations, and (2) an active, experimentation-based track where hackathon-type online activities precede the workshop, and hackathon teams/individuals present their work in the workshop. Datasets and a suite of tools for use in the hackathon are provided by the EMBEDDIA consortium.

Hackashop on news media content analysis and automated report generation – Call for hackathon participation

The EMBEDDIA consortium is happy to issue the call for hackathon participation in the Hackashop on news media content analysis and automated report generation in conjunction with EACL 2021.

The call for hackathon participation is available here.

The hackathon targets anyone interested in Natural Language Processing and Machine Learning and will provide access to relevant tools and models from ongoing research, as well as datasets and support from technical experts.

Due to Covid-19, the hackathon is organised as a virtual event over a period of three weeks, Feb 1 – Feb 21, 2021. A specialty of the hackashop is that completed hackathon projects are invited to submit a brief report (appr. 2 pages) to the hackashop workshop proceedings, to be published by EACL, and to present their project briefly (5-10 min) in the workshop event.

The call for peer-reviewed workshop papers for the hackashop has been previously published here.

Hackashop on news media content analysis and automated report generation – Call for workshop papers

The EMBEDDIA consortium is proud to announce the organization of the Hackashop on news media content analysis and automated report generation in conjunction with EACL 2021.

The Call for workshop papers is now published — more details are available here.

We welcome work broadly in the area of natural language processing of news media, addressing the various needs from the readers who consume news of their personal interest to journalists who keep track of what is going on in the world, try to understand what their readers think of various topics, or want to automate routine reporting.

Workshop on modern NLP through large pre-trained language models

EMBEDDIA partners from the Faculty of Computer and Information Science (University of Ljubljana) organized a workshop on modern NLP through large pre-trained language models on September 29th, 2020 in Ljubljana, Slovenia.

The workshop was primarily aimed at data scientists (academics, professionals, or students) that know some programming in Python and want to learn the basics of modern natural language processing. It was instructed by EMBEDDIA’s technical manager Marko Robnik-Šikonja and touched on the following subjects:

  • text preprocessing,
  • text representations,
  • basics of neural networks for text processing,
  • neural language models,
  • BERT and transformers,
  • hands-on (a downstream task with transformers): sentiment analysis, named entity recognition, text generation, etc.

EMBEDDIA tools standing out on international challenges

The EMBEDDIA team is glad to announce our tools are performing great at international challenges. 

The results in multilingual and social information of our semantic enrichment tools recently outperformed all other participants in the official rankings in all languages in: 

HIPE (Identifying Historical People, Places and other Entities) is a evaluation campaign on named entity processing on historical newspapers in French, German and English, organized in the context of the impresso project and run as a CLEF 2020 Evaluation Lab.

FinNum is a task for fine-grained numeral understanding in financial social media data – to identify the linking between the target cashtag and the target numeral.

Also! Our multilingual fake news spreader model (in English and Spanish) came out third (out of 66 participants) at this year’s PAN. You can find out more about the fake news spreader model on this link