Detalhes do pacote

@s524797336/urllib

s5247973366MIT1.0.0

Help in opening URLs (mostly HTTP) in a complex world — basic and digest authentication, redirections, cookies and more.

urllib, http, urlopen, curl

readme (leia-me)

urllib

NPM version build status appveyor build status Test coverage David deps Known Vulnerabilities npm download

Request HTTP URLs in a complex world — basic and digest authentication, redirections, cookies, timeout and more.

Install

$ npm install urllib --save

Usage

callback

var urllib = require('urllib');

urllib.request('http://cnodejs.org/', function (err, data, res) {
  if (err) {
    throw err; // you need to handle error
  }
  console.log(res.statusCode);
  console.log(res.headers);
  // data is Buffer instance
  console.log(data.toString());
});

Promise

If you've installed bluebird, bluebird will be used. urllib does not install bluebird for you.

Otherwise, if you're using a node that has native v8 Promises (v0.11.13+), then that will be used.

Otherwise, this library will crash the process and exit, so you might as well install bluebird as a dependency!

var urllib = require('urllib');

urllib.request('http://nodejs.org').then(function (result) {
  // result: {data: buffer, res: response object}
  console.log('status: %s, body size: %d, headers: %j', result.res.statusCode, result.data.length, result.res.headers);
}).catch(function (err) {
  console.error(err);
});

co & generator

If you are using co or koa:

var co = require('co');
var urllib = require('urllib');

co(function* () {
  var result = yield urllib.requestThunk('http://nodejs.org');
  console.log('status: %s, body size: %d, headers: %j',
    result.status, result.data.length, result.headers);
})();

Global response event

You should create a urllib instance first.

var httpclient = require('urllib').create();

httpclient.on('response', function (info) {
  error: err,
  ctx: args.ctx,
  req: {
    url: url,
    options: options,
    size: requestSize,
  },
  res: res
});

httpclient.request('http://nodejs.org', function (err, body) {
  console.log('body size: %d', body.length);
});

API Doc

Method: http.request(url[, options][, callback])

Arguments

  • url String | Object - The URL to request, either a String or a Object that return by url.parse.
  • options Object - Optional
    • method String - Request method, defaults to GET. Could be GET, POST, DELETE or PUT. Alias 'type'.
    • data Object - Data to be sent. Will be stringify automatically.
    • dataAsQueryString Boolean - Force convert data to query string.
    • content String | Buffer - Manually set the content of payload. If set, data will be ignored.
    • stream stream.Readable - Stream to be pipe to the remote. If set, data and content will be ignored.
    • writeStream stream.Writable - A writable stream to be piped by the response stream. Responding data will be write to this stream and callback will be called with data set null after finished writing.
    • consumeWriteStream [true] - consume the writeStream, invoke the callback after writeStream close.
    • contentType String - Type of request data. Could be json. If it's json, will auto set Content-Type: application/json header.
    • nestedQuerystring Boolean - urllib default use querystring to stringify form data which don't support nested object, will use qs instead of querystring to support nested object by set this option to true.
    • dataType String - Type of response data. Could be text or json. If it's text, the callbacked data would be a String. If it's json, the data of callback would be a parsed JSON Object and will auto set Accept: application/json header. Default callbacked data would be a Buffer.
    • fixJSONCtlChars Boolean - Fix the control characters (U+0000 through U+001F) before JSON parse response. Default is false.
    • headers Object - Request headers.
    • timeout Number | Array - Request timeout in milliseconds for connecting phase and response receiving phase. Defaults to exports.TIMEOUT, both are 5s. You can use timeout: 5000 to tell urllib use same timeout on two phase or set them seperately such as timeout: [3000, 5000], which will set connecting timeout to 3s and response 5s.
    • auth String - username:password used in HTTP Basic Authorization.
    • digestAuth String - username:password used in HTTP Digest Authorization.
    • agent http.Agent - HTTP Agent object. Set false if you does not use agent.
    • httpsAgent https.Agent - HTTPS Agent object. Set false if you does not use agent.
    • ca String | Buffer | Array - An array of strings or Buffers of trusted certificates. If this is omitted several well known "root" CAs will be used, like VeriSign. These are used to authorize connections. Notes: This is necessary only if the server uses the self-signed certificate
    • rejectUnauthorized Boolean - If true, the server certificate is verified against the list of supplied CAs. An 'error' event is emitted if verification fails. Default: true.
    • pfx String | Buffer - A string or Buffer containing the private key, certificate and CA certs of the server in PFX or PKCS12 format.
    • key String | Buffer - A string or Buffer containing the private key of the client in PEM format. Notes: This is necessary only if using the client certificate authentication
    • cert String | Buffer - A string or Buffer containing the certificate key of the client in PEM format. Notes: This is necessary only if using the client certificate authentication
    • passphrase String - A string of passphrase for the private key or pfx.
    • ciphers String - A string describing the ciphers to use or exclude.
    • secureProtocol String - The SSL method to use, e.g. SSLv3_method to force SSL version 3.
    • followRedirect Boolean - follow HTTP 3xx responses as redirects. defaults to false.
    • maxRedirects Number - The maximum number of redirects to follow, defaults to 10.
    • formatRedirectUrl Function - Format the redirect url by your self. Default is url.resolve(from, to).
    • beforeRequest Function - Before request hook, you can change every thing here.
    • streaming Boolean - let you get the res object when request connected, default false. alias customResponse
    • gzip Boolean - Accept gzip response content and auto decode it, default is false.
    • timing Boolean - Enable timing or not, default is false.
    • enableProxy Boolean - Enable proxy request, default is false.
    • proxy String | Object - proxy agent uri or options, default is null.
  • callback(err, data, res) Function - Optional callback.
    • err Error - Would be null if no error accured.
    • data Buffer | Object - The data responsed. Would be a Buffer if dataType is set to text or an JSON parsed into Object if it's set to json.
    • res http.IncomingMessage - The response.

Returns

http.ClientRequest - The request.

Calling .abort() method of the request stream can cancel the request.

Options: options.data

When making a request:

urllib.request('http://example.com', {
  method: 'GET',
  data: {
    'a': 'hello',
    'b': 'world'
  }
});

For GET request, data will be stringify to query string, e.g. http://example.com/?a=hello&b=world.

For others like POST, PATCH or PUT request, in defaults, the data will be stringify into application/x-www-form-urlencoded format if Content-Type header is not set.

If Content-type is application/json, the data will be JSON.stringify to JSON data format.

Options: options.content

options.content is useful when you wish to construct the request body by yourself, for example making a Content-Type: application/json request.

Notes that if you want to send a JSON body, you should stringify it yourself:

urllib.request('http://example.com', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json'
  },
  content: JSON.stringify({
    a: 'hello',
    b: 'world'
  })
});

It would make a HTTP request like:

POST / HTTP/1.1
Host: example.com
Content-Type: application/json

{
  "a": "hello",
  "b": "world"
}

This exmaple can use options.data with application/json content type:

urllib.request('http://example.com', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json'
  },
  data: {
    a: 'hello',
    b: 'world'
  }
});

Options: options.stream

Uploads a file with formstream:

var urllib = require('urllib');
var formstream = require('formstream');

var form = formstream();
form.file('file', __filename);
form.field('hello', '你好urllib');

var req = urllib.request('http://my.server.com/upload', {
  method: 'POST',
  headers: form.headers(),
  stream: form
}, function (err, data, res) {
  // upload finished
});

Response Object

Response is normal object, it contains:

  • status or statusCode: response status code.
    • -1 meaning some network error like ENOTFOUND
    • -2 meaning ConnectionTimeoutError
  • headers: response http headers, default is {}
  • size: response size
  • aborted: response was aborted or not
  • rt: total request and response time in ms.
  • timing: timing object if timing enable.
  • remoteAddress: http server ip address
  • remotePort: http server ip port

Response: res.aborted

If the underlaying connection was terminated before response.end() was called, res.aborted should be true.

require('http').createServer(function (req, res) {
  req.resume();
  req.on('end', function () {
    res.write('foo haha\n');
    setTimeout(function () {
      res.write('foo haha 2');
      setTimeout(function () {
        res.socket.end();
      }, 300);
    }, 200);
    return;
  });
}).listen(1984);

urllib.request('http://127.0.0.1:1984/socket.end', function (err, data, res) {
  data.toString().should.equal('foo haha\nfoo haha 2');
  should.ok(res.aborted);
  done();
});

HttpClient2

HttpClient2 is a new instance for future. request method only return a promise, compatible with async/await and generator in co.

Options

options extends from urllib, besides below

  • retry Number - a retry count, when get an error, it will request again until reach the retry count.
  • retryDelay Number - wait a delay(ms) between retries.
  • isRetry Function - determine whether retry, a response object as the first argument. it will retry when status >= 500 by default. Request error is not included.

Proxy

Support both http and https protocol.

Notice: Only support on Node.js >= 4.0.0

Programming

urllib.request('https://twitter.com/', {
  enableProxy: true,
  proxy: 'http://localhost:8008',
}, (err, data, res) => {
  console.log(res.status, res.headers);
});

System environment variable

  • http
HTTP_PROXY=http://localhost:8008
http_proxy=http://localhost:8008
  • https
HTTP_PROXY=http://localhost:8008
http_proxy=http://localhost:8008
HTTPS_PROXY=https://localhost:8008
https_proxy=https://localhost:8008
$ http_proxy=http://localhost:8008 node index.js

TODO

  • [ ] Support component
  • [ ] Browser env use Ajax
  • [√] Support Proxy
  • [√] Upload file like form upload
  • [√] Auto redirect handle
  • [√] https & self-signed certificate
  • [√] Connection timeout & Response timeout
  • [√] Support Accept-Encoding=gzip by options.gzip = true
  • [√] Support Digest access authentication

License

MIT

changelog (log de mudanças)

2.25.1 / 2017-10-20

fixes

others

2.25.0 / 2017-09-08

features

2.24.0 / 2017-07-31

  • feat: support http(s) proxy (#226)

2.23.0 / 2017-07-18

  • test: skip test.webdav.org test cases
  • feat: add defaultArgs on HttpClient

2.22.0 / 2017-04-10

  • feat: add options.nestedQuerystring (#254)

2.21.2 / 2017-03-19

  • fix: don't listen response aborted on node > 0.12 (#252)

2.21.1 / 2017-03-16

  • fix: throw when write to stream timeout (#251)

2.21.0 / 2017-02-27

  • fix: should pass options to httpclient2 (#249)
  • test: fix Promise not defined on 0.10
  • test: use assert instead of should
  • feat: add retry delay on httpclient2

2.20.0 / 2017-02-06

  • deps: bump deps versions
  • fix: keep the same req object across request and response event

2.19.0 / 2016-12-14

  • feat: add dataAsQueryString params for convert data to query string (#240)

2.18.0 / 2016-12-07

  • fix: use nextTick to prevent promise handling error.
  • refactor: move to separated files
  • feat: add retry option

2.17.1 / 2016-11-25

  • add environment detection for connect timer, because no socket event in browser env (#236)

2.17.0 / 2016-10-13

  • feat: add -2 status for connect timeout (#224)

2.16.1 / 2016-10-10

  • fix: parse content-type (#221)

2.16.0 / 2016-09-27

  • feat: add custom dns lookup function (#220)

2.15.1 / 2016-09-26

  • fix: httpclient support set agent to false (#219)

2.15.0 / 2016-09-21

  • feat: export remoteAddress and remotePort (#216)

2.14.0 / 2016-09-19

  • feat: allow user to rewrite redirect url (#214)

2.13.2 / 2016-09-18

  • fix: response size should use last one (#213)

2.13.1 / 2016-09-10

  • fix: add missing ctx on request event (#210)

2.13.0 / 2016-08-09

  • feat: timing (#204)
  • docs: fix res.aborted description

2.12.0 / 2016-08-08

  • feat: support connect and response timeouts (#201)

2.11.1 / 2016-08-04

  • fix: catch http.request sync error (#199)

2.11.0 / 2016-06-26

  • deps: upgrade deps from ~ to ^ (#189)

2.10.0 / 2016-06-21

  • feat: add an options consumeWriteStream (#187)
  • chore(package): update statuses to version 1.3.0 (#174)

2.9.1 / 2016-05-09

  • fix: check url before request (#172)
  • chore(package): update any-promise to version 1.2.0 (#171)

2.9.0 / 2016-04-21

  • feat: log all requested urls (#169)
  • deps: agentkeepalive@2.1.1

2.8.0 / 2016-02-27

  • test: improve coverage
  • feat: http default protocol for URL argument

2.7.3 / 2016-02-27

  • deps: upgrade out of date deps

2.7.2 / 2016-02-25

  • test: support windows
  • fix: keep headers.Host on location: /foo redirect
  • test: use npmjs.com on travis ci
  • fix: jshint style
  • deps: any-promise instead of native-or-blubird

2.7.1 / 2016-02-02

  • fix: clean up headers.Host before redirect request start
  • chore: update authors

2.7.0 / 2016-01-14

  • feat: response event include data property
  • chore: Add host info into debug

2.6.0 / 2015-12-09

  • test: fix unstable test cases
  • feat: enhance global events
  • chore(package): update semver to version 5.1.0
  • chore(package): update should to version 7.1.1

2.5.0 / 2015-09-30

  • test: fix test url
  • feat: remove request# in error message
  • test: add streaming upload test
  • test: use codecov.io

2.4.0 / 2015-08-20

  • feat: add options.fixJSONCtlChars to fix JSON control characters
  • Fix a typo in comment

2.3.11 / 2015-08-12

  • fix: httpclient support curl too

2.3.10 / 2015-08-12

  • fix: add alias urllib.curl()
  • chore: add decodeBodyByCharset error debug log

2.3.9 / 2015-07-23

  • feat: show json format data when json parse error

2.3.8 / 2015-06-06

  • fix: need to clear timer after follow redirect

2.3.7 / 2015-06-04

  • test: use cnpmjs.org instead of taobao.com
  • fix: need to resume res before next redirect request start

2.3.6 / 2015-06-03

  • fix: support 303, 305, 307 redirect status code

2.3.5 / 2015-05-11

  • fix: followRedirect support customResponse.

2.3.4 / 2015-04-19

  • feat: show agent status message when request error

2.3.3 / 2015-03-30

  • fix: add ciphers and secureProtocol params support for https request

2.3.2 / 2015-03-29

  • refactor: httpclient custom agent property

2.3.1 / 2015-03-08

  • fix: auto decode gzip content

2.3.0 / 2015-02-16

  • feat: mark off connection state and response state

2.2.2 / 2015-01-21

  • remove unuse event handlers

2.2.1 / 2014-12-10

  • refactor and add more comments
  • add path to error (@coderhaoxin)
  • fix promise example in readme

2.2.0 / 2014-11-28

  • add customResponse option (@fishbar)

2.1.0 / 2014-11-15

  • humanize timeout

2.0.2 / 2014-11-01

  • chore: bump deps version and make test more stable
  • refactor: dont add new property on res object

2.0.1 / 2014-10-15

  • add args.contentType option (@coderhaoxin)
  • Simply the HTTPClient implementation (@JacksonTian)
  • refine urllib code (@JacksonTian)

2.0.0 / 2014-10-13

  • support auto decode charset when dataType set

1.5.2 / 2014-09-15

  • do not check ssl, fix hang up in some node version

1.5.1 / 2014-09-10

  • httpclient add requestThunk()

1.5.0 / 2014-09-10

  • add requestThunk to support co

1.4.1 / 2014-08-28

  • HttpClient support agent and httpsAgent

1.4.0 / 2014-08-27

  • add SocketAssignTimeoutError. #37

1.3.1 / 2014-08-27

  • convert data to string when dataType is text

1.3.0 / 2014-08-26

  • add urllib instance

1.2.1 / 2014-08-26

  • add args.ctx for response event easy logging

1.2.0 / 2014-08-26

  • format Response object fields

1.1.0 / 2014-08-25

  • global response event. fixed #35

1.0.0 / 2014-08-25

  • return Promise when callback missing. fixed #33
  • rm Makefile
  • use flat image

0.5.17 / 2014-08-08

  • Remove aborted. joyent/node#7457
  • missing I in urllib logo

0.5.16 / 2014-05-15

  • fix test cases
  • change .once to .on (@alsotang)

0.5.15 / 2014-05-04

  • make callback is optional. close #29
  • rm 0.8 from travis

0.5.14 / 2014-04-21

  • fix #28 user-agent logic bug

0.5.13 / 2014-03-31

  • use digest-header module

0.5.12 / 2014-03-29

  • support Digest access authentication. fix #27
  • add co-urllib desc

0.5.11 / 2014-03-13

  • improve user-agent, add node version and plaform detail

0.5.10 / 2014-03-11

  • if body not decode, dont touch it

0.5.9 / 2014-03-10

  • Support options.gzip = true to handle gzip response. fixed #26

0.5.8 / 2014-03-07

  • remove buffer-concat

0.5.7 / 2014-03-07

  • no more deps on buffer-concat
  • add default User-Agent: node-urllib/x.x.x
  • add jshint

0.5.6 / 2014-03-05

  • add data/res to error
  • fix typo (@coderhaoxin)
  • access npmjs.org https
  • fix test cases and use autod
  • install from cnpm
  • no more support on node 0.6.x

0.5.5 / 2013-12-10

  • should pass done instead of callback and end the writeStream
  • support args.writeStream with follow redirect (@dead-horse)

0.5.4 / 2013-11-09

  • fix timeout not effect bug

0.5.3 / 2013-10-18

  • add args.beforeRequest(options) hook to change options before http send

0.5.2 / 2013-09-23

  • add JSONResponseFormatError; append request url infomation to err.message

0.5.1 / 2013-08-23

  • detect connect timeout or response timeout fixed #18
  • update doc

0.5.0 / 2013-08-11

  • Support max redirects to protect loop redirect
  • Auto redirect handle (@ibigbug)

0.4.4 / 2013-08-10

  • handle json response to null when data size is zero

0.4.3 / 2013-08-10

  • Auto convert data to json string when content-type is 'json' fixed #15
  • add drone.io status build image

0.4.2 / 2013-08-10

  • fix SELF_SIGNED_CERT_IN_CHAIN test case on node 0.8 and 0.6
  • [√] https & self-signed certificate

0.4.1 / 2013-08-05

  • return RemoteSocketClosedError when Remote socket was terminated before response.end() was called

0.4.0 / 2013-08-05

  • If the underlaying connection was terminated before response.end() was called, res.aborted should be true. fixed #14
  • fixed test case for 0.6
  • add res.socket.end() test cases
  • remove 0.11 from travis

0.3.8 / 2013-08-02

  • add debug log

0.3.7 / 2013-07-11

  • PATCH method is also "application/x-www-form-urlencoded" by default
  • replace logo

0.3.6 / 2013-07-11

  • fixed bug in processing query string #13 (@xingrz)
  • updated readme example (@xingrz)
  • update authors
  • API docs (@xingrz)

0.3.5 / 2013-07-10

  • fixed writeSteam receive incomplete bug
  • update makefile
  • add coveralls
  • remove 0.11 from travis
  • add patch for node 0.6
  • fixed https request timeout tests
  • use blanket instead of jscover

0.3.4 / 2013-03-06

  • fixed #8 auto add application/x-www-form-urlencoded
  • fixed existsSync for node < 0.8

0.3.3 / 2012-12-14

  • support writeStream

0.3.2 / 2012-11-08

  • fixed #4 support urllib.request(options, args, callback)
  • fixed usage demo bug
  • fixed readme

0.3.1 / 2012-11-05

  • fixed #2 support stream and return the req object.
  • use jscover instead of jscoverage

0.3.0 / 2012-10-10

  • add coverage results
  • Bash auth support: http://user:password@http://demo.com .