PlaywrightCrawler extract_links does not respect strategy

I was crawling a test site that I have hosted locally (localhost).

My PlaywrightCrawler subclass must add additional `user_data` to each request, so I extract and form each request manually.

```
new_requests = await context.extract_links(strategy='same-origin')
for new_request in new_requests:
        print(f"[LINK] Extracted link: {new_request.url}")
```


I start my crawl pointed at `http://localhost`, yet the crawler ends up crawling YouTube since there is a link to YouTube on my site.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PlaywrightCrawler extract_links does not respect strategy #1212

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

PlaywrightCrawler extract_links does not respect strategy #1212

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions