Skip to content

Scrape Data Activity

The Scrape Data activity enables the bulk retrieval of information from a browser or application. It captures structured data and stores it in a designated dataset for further processing.

Field Description Requirement
Application name Specifies the reference name of an already opened or attached application. Required
Scraper name Defines the name of the dataset where the retrieved data will be stored. Required
Table Provides the XPath locator for the target table. Required
Header Specifies the XPath for the table's header section. Leave blank if no header exists. Optional
First record header When selected, designates the first scraped record as a header or title. Optional
Row Defines the XPath that represents all rows within the table. Required
Column Specifies the XPath for individual cells or repetitive data elements within the table rows. Required
Advertisement Provides the XPath locator for an advertisement button or pop-up, if present. Optional
Wait (ms) Sets the waiting period in milliseconds during the scraping process. Optional
Next page Specifies the XPath locator for the "next page" button, if the table spans multiple pages. This XPath should not be present on the last page. Optional
Initial popup click Provides the XPath locator for an initial pop-up's click action. Optional
Popup Specifies the XPath locator for the close button of a pop-up on the page. Optional
Max count Defines the maximum number of data records to be scraped. Optional
Hover When set to True, the page scrolls down after each data scrape. Optional

Action Types & Examples

Application name

  • Format: string
  • Example Result: robustaWebpage

Scraper name

  • Format: string
  • Example Result: scrapeTable

Table

  • Format: string (XPath)
  • Example Result: (//table[@class="unitTable"])[1]
  • Format: string (XPath)
  • Example Result: thead//tr//th

Row

  • Format: string (XPath)
  • Example Result: tbody//tr

Column

  • Format: string (XPath)
  • Example Result: td
  • Format: string (XPath)
  • Example Result: //*[@value="Adv"]/[text()]

Wait (ms)

  • Format: digit
  • Example Result: 3000

Next page

  • Format: string (XPath)
  • Example Result: //*[@value="Next"]/[text()]

Initial popup click

  • Format: string (XPath)
  • Example Result: @class='siebui-popup-button']/*[@title='Pick Address:New']
  • Format: string (XPath)
  • Example Result: @class='siebui-popup-button']/*[@title='Pick Address:New']

Max count

  • Format: digit
  • Example Result: 10

Hover

  • Format: boolean
  • Example Result: true

Implementation Examples

Field Setup - Application name: robustaWebpage - Scraper name: scrapeTable - Table: (//table[@class="unitTable"])[1] - Header: thead//tr//th - Row: tbody//tr - Column: td - Advertisement: //*[@value="Adv"]/[text()] - Next page: //*[@value="Next"]/[text()] - Initial popup click: @class='siebui-popup-button']/*[@title='Pick Address:New'] - Popup: @class='siebui-popup-button']/*[@title='Pick Address:New'] - Hover: true

Execution Parameters - Wait (ms): 3000 - Max count: 10

Technical Notes

For detailed guidance on utilizing the scrape function with WinSpy, refer to the WinSpy Usage on Web Browser documentation.