Skip to main content

Scrapy: setting authorization header for proxy in middleware

I am trying to send scrapy requests through a proxy that requires an authorization. I updated the process_request method from the default middleware (middleware.py).

I tried several ways to achieve it but everytime I get the following error messsage : ERROR: Gave up retrying <GET https://api.ipify.org/> (failed 3 times): Could not open CONNECT tunnel with proxy proxy_ip:proxy_port [{'status': 407, 'reason': b'Proxy Authentication Required'}]

Here is what I tried :

def process_request(self, request, spider):           
    request.meta['proxy'] = 'http://proxy_ip:proxy_port'
    proxy_user_pass = "username:password"
    encoded_user_pass = base64.encodestring(proxy_user_pass.encode()).decode()
    request.headers['Proxy-Authorization'] = 'Basic ' + encoded_user_pass
    return None

I try other ways of encoding the header, such as :

From : https://www.zyte.com/blog/scrapy-proxy/

request.headers['Proxy-Authorization'] = basic_auth_header("username", "password")

From: https://github.com/aivarsk/scrapy-proxies/blob/master/scrapy_proxies/randomproxy.py

encoded_user_pass = base64.b64encode(proxy_user_pass.encode()).decode()

From : Scrapy cookies not working when sending Proxy-Authorization header

request.headers['Proxy-Authorization'] = 'Basic ' + encoded_user_pass.strip()

The username / password have been tested and work properly If I whitelist my current ip (no authorization required) I can send a request using only request.meta['proxy'] = 'http://proxy_ip:proxy_port'. Though this is not a solution as I do not control the ip from which the request is sent.

Any idea what goes wrong with my request?



source https://stackoverflow.com/questions/73561410/scrapy-setting-authorization-header-for-proxy-in-middleware

Comments

Popular posts from this blog

Prop `className` did not match in next js app

I have written a sample code ( Github Link here ). this is a simple next js app, but giving me error when I refresh the page. This seems to be the common problem and I tried the fix provided in the internet but does not seem to fix my issue. The error is Warning: Prop className did not match. Server: "MuiBox-root MuiBox-root-1" Client: "MuiBox-root MuiBox-root-2". Did changes for _document.js, modified _app.js as mentioned in official website and solutions in stackoverflow. but nothing seems to work. Could someone take a look and help me whats wrong with the code? Via Active questions tagged javascript - Stack Overflow https://ift.tt/2FdjaAW

How to show number of registered users in Laravel based on usertype?

i'm trying to display data from the database in the admin dashboard i used this: <?php use Illuminate\Support\Facades\DB; $users = DB::table('users')->count(); echo $users; ?> and i have successfully get the correct data from the database but what if i want to display a specific data for example in this user table there is "usertype" that specify if the user is normal user or admin i want to user the same code above but to display a specific usertype i tried this: <?php use Illuminate\Support\Facades\DB; $users = DB::table('users')->count()->WHERE usertype =admin; echo $users; ?> but it didn't work, what am i doing wrong? source https://stackoverflow.com/questions/68199726/how-to-show-number-of-registered-users-in-laravel-based-on-usertype

Why is my reports service not connecting?

I am trying to pull some data from a Postgres database using Node.js and node-postures but I can't figure out why my service isn't connecting. my routes/index.js file: const express = require('express'); const router = express.Router(); const ordersCountController = require('../controllers/ordersCountController'); const ordersController = require('../controllers/ordersController'); const weeklyReportsController = require('../controllers/weeklyReportsController'); router.get('/orders_count', ordersCountController); router.get('/orders', ordersController); router.get('/weekly_reports', weeklyReportsController); module.exports = router; My controllers/weeklyReportsController.js file: const weeklyReportsService = require('../services/weeklyReportsService'); const weeklyReportsController = async (req, res) => { try { const data = await weeklyReportsService; res.json({data}) console