EMBEDDIA Hackashop, a recapitulation

On April 19, we wrapped the EACL Hackashop on News Media Content Analysis and Automated Report Generation. The aim of Hackashop 2021 was to foster discussion and research on the combination of language technology and news media content. It provided a forum for both discussing scientific advances in the analysis of news stories and their reader comments and automated generation of reports, as well as for experimental work on identifying interesting phenomena in reader comments and reporting on them.

The hackashop was implemented in a dual format. A traditional track consisted of submission of scientific papers, their reviews, and finally paper presentations. It was complemented by an active, experimentation-based track consisting of an online hackathon preceding the workshop, with the presentation of the results in the joint workshop event. Both tracks shared the same topic, news media analysis, and generation, and participants to the two tracks had a good amount of overlap.

In the workshop track, we encouraged submissions of long and short papers. Based on three expert reviews for each submission, weighing the contributions of the submission against its length, 13 papers were selected for presentation in the workshop event.

The online hackathon was organized during a three-week period in February 2021, with six participating teams. The challenges they addressed covered a broad range, as each team had the freedom to define their own aims. In the spirit of providing a joint forum for discussing both scientific advances and experimental work, five hackathon teams submitted short reports to be included in this proceedings.

We were very happy to see several cross-disciplinary and cross-sector collaborations involving, e.g., computer scientists, social scientists, and the media industry, both in workshop papers and hackathon contributions. We were also happy to have numerous contributions that address multilingual settings and low-resource languages.

The workshop event on 19 April 2021 brought both tracks together, with presentations of both scientific workshop papers and empirical hackathon reports. We concluded the Hackashop with an excellent presentation of our keynote speaker, professor Neil Maiden.

We would once again like to thank all workshop paper authors and hackathon participants for their contributions to the hackashop! We are thankful to the programme committee members for their insightful reviews of the workshop papers. We are equally thankful to the large number of experts who made tools, models, data, and challenges available for the hackathon and provided support for the participants.

Authors: Hannu Toivonen and Michele Boggia

EACL Hackashop on 19 April 2021: programme and proceedings now available

The EACL Hackashop on News Media Content Analysis and Automated Report Generation will have its main event on 19 April 2021, in conjunction with EACL. A pre-print of the proceedings is available from the hackashop home page, with 13 peer-reviewed workshop papers, five hackathon reports and a resource description paper.

The programme will contain a keynote and a roundtable with invited experts, in addition to the contributed poster spotlights and presentations.

Registration to the hackashop is included in the registration for EACL, i.e., there is no separate workshop/hackashop registration. Early registration ends April 7th.

EMBEDDIA Hackathon wrap-up

On Friday, February 19, we wrapped up the EMBEDDIA hackathon. In an online event, the hackathon participants presented their results and their views on the EMBEDDIA tools and identified challenges.

We like to extend our gratitude to the hackathon participants and the EMBEDDIA staff for making the hackathon a success. It was a very nice opportunity for the EMBEDDIA consortium to see our developed tools being utilized outside of the consortium for similar or newly identified NLP challenges.

Below are snapshots of the wrap-up meeting.

EMBEDDIA Hackashop: Hackathon halfway get-together

On February 10, the EMBEDDIA consortium organized a hackathon get-together of hackathon participants and EMBEDDIA staff. We used this event to check-in with the teams and present the expectations and challenges of our media partner, the Finnish News Agency (STT).

The interaction with hackathon teams was conducted via the Gather.town application and it was in the form of a tool/model/data/challenge support session. We used the Gather.town application to make the event less formal and more social. Participants were able to wander around and meet other participants and see what they are working on — or to chat with other researchers from EMBEDDIA!

Below are snapshots of today’s event.

Kick-off of the Hackashop on news media content analysis and automated report generation

Today the EMBEDDIA consortium officially kicked-off the Hackashop on news media content analysis and automated report generation. Project partners presented the projects, challenges, and data to be used in the course of the hackashop. Due to the pandemic, the hackashop will be an online event. The hackathon part of the hackashop will run from February 1-21, 2021.

Below are some snapshots from today’s event.

2nd call: Hackashop on news media content analysis and automated report generation

The EMBEDDIA consortium is happy to publish updated calls for the Hackashop on news media content analysis and automated report generation in conjunction with EACL 2021:

Call for workshop papers: paper submission deadline Jan 31 (extended)
Call for online hackathon participation:
registration by Jan 29, event on Feb 1-21

The hackashop implements a novel, dual format: (1) a traditional track with paper submissions, reviews and paper presentations, and (2) an active, experimentation-based track where hackathon-type online activities precede the workshop, and hackathon teams/individuals present their work in the workshop. Datasets and a suite of tools for use in the hackathon are provided by the EMBEDDIA consortium.

Hackashop on news media content analysis and automated report generation – Call for hackathon participation

The EMBEDDIA consortium is happy to issue the call for hackathon participation in the Hackashop on news media content analysis and automated report generation in conjunction with EACL 2021.

The call for hackathon participation is available here.

The hackathon targets anyone interested in Natural Language Processing and Machine Learning and will provide access to relevant tools and models from ongoing research, as well as datasets and support from technical experts.

Due to Covid-19, the hackathon is organised as a virtual event over a period of three weeks, Feb 1 – Feb 21, 2021. A specialty of the hackashop is that completed hackathon projects are invited to submit a brief report (appr. 2 pages) to the hackashop workshop proceedings, to be published by EACL, and to present their project briefly (5-10 min) in the workshop event.

The call for peer-reviewed workshop papers for the hackashop has been previously published here.

Hackashop on news media content analysis and automated report generation – Call for workshop papers

The EMBEDDIA consortium is proud to announce the organization of the Hackashop on news media content analysis and automated report generation in conjunction with EACL 2021.

The Call for workshop papers is now published — more details are available here.

We welcome work broadly in the area of natural language processing of news media, addressing the various needs from the readers who consume news of their personal interest to journalists who keep track of what is going on in the world, try to understand what their readers think of various topics, or want to automate routine reporting.

Workshop on modern NLP through large pre-trained language models

EMBEDDIA partners from the Faculty of Computer and Information Science (University of Ljubljana) organized a workshop on modern NLP through large pre-trained language models on September 29th, 2020 in Ljubljana, Slovenia.

The workshop was primarily aimed at data scientists (academics, professionals, or students) that know some programming in Python and want to learn the basics of modern natural language processing. It was instructed by EMBEDDIA’s technical manager Marko Robnik-Šikonja and touched on the following subjects:

  • text preprocessing,
  • text representations,
  • basics of neural networks for text processing,
  • neural language models,
  • BERT and transformers,
  • hands-on (a downstream task with transformers): sentiment analysis, named entity recognition, text generation, etc.