back to list

DIW Graduate Center Masterclass by Timm Teubner (TU Berlin)

DatesMay 05, 2021 - May 06, 2021
Date Details
May 05, 2021 Open Details
Time10:00–12:00
Location
Online
May 06, 2021 Open Details
Time10:00–12:00
Location
Online

DIW Graduate Center Masterclass by Timm Teubner (TU Berlin)

Topic:Web Crawling

In this beginner’s guide to web crawling, we will cover the basics of how to automatically extract information from static and dynamic websites. The course will include a fair share of “hands on” work, in which we will write and run code ourselves (Java). If you plan to code along (highly recommended), please have a Java IDE ready for the workshop (e.g. using Eclipse). Beyond the basic principles of how to access websites, we will learn how to navigate the retrieved HTML code (JSoup), how to deal with dynamic and interactive pages (Selenium), and consider some important legal aspects.

Course information can also be found on the GC Masterclass website.

If you are interested in attending the course, please send an email to Juliane Metzner (jmetzner(at)diw.de) for registration.
Once you have registered you will receive a link/invitation for the online class.

Go to speaker website.