http-parser

Commit Graph

Author	SHA1	Message	Date
Fedor Indutny	0097de5895	src: use memchr() in h_general header value	10 years ago
Fedor Indutny	c6097e1d76	src: faster general header value loop	10 years ago
Fedor Indutny	263006044a	src: less loads in header_value loop	10 years ago
Fedor Indutny	0cb0ee672c	src: tighten header field/value loops	10 years ago
Fedor Indutny	6132d1fefa	src: save progress	10 years ago
Jeff Pinner	0b43367131	http_parser: Follow RFC-7230 Sec 3.2.4 RFC-7230 Sec 3.2.4 expressly forbids line-folding in header field-names. This change no longer allows obsolete line-folding between the header field-name and the colon. If HTTP_PARSER_STRICT is unset, the parser still allows space characters. Reviewed-By: Fedor Indutny <fedor@indutny.com>	10 years ago
Alexis La Goutte	5b951d74bd	src: fix clang warning Fix http_parser.c:2147:3: warning: Value stored to 'uf' is never read found by Clang Analyser. Reviewed-By: Fedor Indutny <fedor@indutny.com>	10 years ago
Helge Heß	1317eeca43	Added support for MKCALENDAR Signed-off-by: Fedor Indutny <fedor@indutny.com>	11 years ago
Vinnie Falco	24e2d2d43f	Allow HTTP_MAX_HEADER_SIZE to be defined externally	11 years ago
David Wragg	76f0f1690f	Fix issues around multi-line headers Always discard leading whitespace in a header value, even if it is folded. Pay attention to values of interesting headers (Connection, Content-Length, etc.) even when they come on a continuation line. Add a test case to check that requests and responses using only LF to separate lines are handled correctly.	11 years ago
David Wragg	5d9c382172	Include separating ws when folding header values The support for folding of multi-line header values does not conform to the specs. Given a request containing Multi-Line-Header: foo<CRLF> bar<CRLF> http-parser will eliminate the whitespace breaking the header value to yield a header value of "foobar". This is confirmed by the LINE_FOLDING_IN_HEADER case in tests.c. But from rfc2616, section 2.2: A CRLF is allowed in the definition of TEXT only as part of a header field continuation. It is expected that the folding LWS will be replaced with a single SP before interpretation of the TEXT value. And from draft-ietf-httpbis-p1-messaging-25, section 3.2.4: A server that receives an obs-fold in a request message that is not within a message/http container MUST either reject the message by sending a 400 (Bad Request), preferably with a representation explaining that obsolete line folding is unacceptable, or replace each received obs-fold with one or more SP octets prior to interpreting the field value or forwarding the message downstream. So in the example above, the header value should be interpreted as "foo bar", possibly with multiple spaces. The current http-parser behaviour of eliminating the LWS altogether clearly deviates from the specs. For http-parser itself to confirm exactly would involve significant changes in order to synthesize replacement SP octets. Such changes are unlikely to be worth it to support what is an obscure and deprecated feature. But http-parser should at least preserve some separating whitespace when folding multi-line header values, so that applications using http-parser can conform to the specs. This commit is a minimal change to preserve whitespace when folding lines. It eliminates the CRLF, but retains any trailing and leading whitespace in the header value.	11 years ago
Alexis Campailla	a252d4eebc	fix content-length and chunk-size overflow test The overflow check didn't work for all possible inputs.	11 years ago
Patrik Stutz	d7b938bdca	Parse and emit status message of response	11 years ago
Ben Noordhuis	f5c779bb85	Update misleading comment. The HTTP_MAX_HEADER_SIZE check is not there to guard against buffer overflows, it's there to protect unwitting embedders against denial-of-service attacks.	11 years ago
Ben Noordhuis	547553b090	Further request method check strengthening.	11 years ago
Chris Dickinson	c6ee6ada69	Do not accept PUN/GEM methods as PUT/GET. * Encountering them returns an error, `HPE_INVALID_METHOD` * Tests have been added.	11 years ago
Ben Noordhuis	d3264312e1	Add function http_parser_version(). Fixes #115.	11 years ago
Tóth Tamás	0938fe599f	Add on_status_complete callback. Add a "status complete" callback to support Simple-Response handling with HTTP version <= 1.0. Patch by Tóth Tamás, tests by Corey Richardson.	12 years ago
Corey Richardson	1c7f8cac9e	Fix IPv6 address parsing. Fixes #133.	12 years ago
Ben Noordhuis	245f6f0078	Remove HTTP_PARSER_DEBUG macro. Remove the HTTP_PARSER_DEBUG macro for two reasons: * It changes the size of struct http_parser, resulting in spurious memory corruption bugs if part of your application is built with HTTP_PARSER_DEBUG=1 and other parts with HTTP_PARSER_DEBUG=0. * It's a debugging tool for maintainers. It should never have been exposed in the API in the first place.	12 years ago
BogDan Vatra	1ca7de5258	Add "int http_body_is_final(const http_parser *parser)" method. It's useful to check if the current chunk is the last one.	12 years ago
Ben Noordhuis	ad3b631d4f	Turn normal_url_char into a bit array. Makes http_parser slightly more cache friendly.	12 years ago
Ben Noordhuis	add3018ce7	Add bounds check to http_method_str().	12 years ago
Ben Noordhuis	9f92347851	Make http_should_keep_alive() const correct.	12 years ago
Bertrand Paquet	a828edaf6a	Add a comment	13 years ago
Bertrand Paquet	50faa793f4	Coding style : remove space before ++	13 years ago
Bertrand Paquet	148984cd8d	Rename s_req_host* to be compliant with RFC 2396	13 years ago
Bertrand Paquet	7f1b191d6f	Minor speed improvment	13 years ago
Bertrand Paquet	d2ce562338	Use new state instead of pointer	13 years ago
Bertrand Paquet	bb29f43741	Coding style improvment	13 years ago
Bertrand Paquet	f6f761596e	Small refactoring, add edge cases	13 years ago
Bertrand Paquet	7965096276	User info implementation	13 years ago
Bertrand Paquet	ed8475d49f	Refactor host parsing to allow basic auth management	13 years ago
Ben Noordhuis	b97fdb0513	Don't assert() on whitespace in URL. Be lenient about tabs and form feeds in non-strict mode.	13 years ago
Ben Noordhuis	8bec3ea459	Create method_strings array with HTTP_METHOD_MAP macro.	13 years ago
Nathan Rajlich	a3373d7627	add support for "SEARCH" request methods	13 years ago
Ben Noordhuis	62110efe7a	Support PURGE request method. Fixes joyent/node#2775.	13 years ago
David Gwynne	67568421e9	allow extra ? at the beginning of a query_string. fixes joyent/http-parser issue #25	13 years ago
David Gwynne	8da60bc423	implement parsing of v6 addresses and rejection of 0-length host and ports. the v6 parsing works by adding extra states for working with the [] notation for v6 addresses. hosts and ports cannot be 0-length because we url parsing from ending when we expect those fields to begin. http_parser_parse_url gets a free check for the correctness of CONNECT urls (they can only be host:port). this addresses the following issues: i was bored and had my head in this space.	13 years ago
David Gwynne	0499525110	Fix http_parser_parse_url for urls like "http://host/path ". Before this change it would include the last slash in the separator between the schema and host as part of the host. we cant use the trick used for skipping the separator before ports, query strings, and fragments because if it was a CONNECT style url string (host:port) it would skip the first character of the hostname. Work around this by introducing a few more states to represent these separators in a url differently to what theyre separating. this in turn lets us simplify the url parsing so can simply skip what it considers delimiters rather than having to special case certain types of url parts and skip their prefixes. Add tests for the http_parser_parse_url(). This compares the http_parser_url struct that http_parser_parse_url() produces against one that we expect from the test. If they differ then http_parser_parse_url() misbehaved.	13 years ago
Ben Noordhuis	c3153bd1a9	Eat CRLF between requests, even on connection:close. Fixes #47.	13 years ago
Ben Noordhuis	f668e72380	Make content_length unsigned, add overflow checks.	13 years ago
James McLaughlin	03e0d5292a	Use "" instead of <> for the http_parser.h include. This avoids having to specify -I when building.	13 years ago
Ben Noordhuis	3e626c6cb6	Don't use 'inline'. 'inline' is not a recognized C89 keyword, it made the build fail with strict or older compilers (msvc 2008, gcc with -std=c89). 'inline' is also just a hint, one that gcc 4.4.3 in this particular case happily ignored. Ergo, remove it.	13 years ago
Ivo Raisr	2a2f99f9cd	http_parser_init does not clear status_code	13 years ago
Andre Caron	051d6fe219	Fixes build on MSVC.	13 years ago
Peter Griess	eb04bbe1fa	Merge pull request #73 from pgriess/http-10-message-length Get HTTP/1.1 message length logic working for HTTP/1.0	13 years ago
Peter Griess	d0bb867d1b	Implement http_parser_pause(). Summary: - Add http_parser_pause() API. A callback may invoke this at any time. This will cause http_parser_parse() to return indicating that it parsed less than the number of requested bytes and set an error to HBE_PAUSED. A paused parser with fail with HBE_PAUSED until it is un-paused with http_parser_pause(). - Stop using 'state', 'header_state', 'index', and 'nread' shadow variables and then updating their http_parser fields when we're done. Instead, update the live values as we go. This will make it possible to return from anywhere in the parser (say, due to EPAUSED) and have valid/expected state. - Update state before making callbacks so that if the want to pause, we'll know the correct state already. - Make sure that every callback has a state that uniquely identifies the next step so that we can resume in the right place if we were suppoed to be paused. - Clean and re-factor up CALLBACK() macros. - Use CALLBACK() macros for (almost) all callbacks; on_headers_complete is still a special case. This includes on_body which we used to invoke manually with a long run of bytes. We now use a 'body' mark and hit its callback just like every other data callback. - Clean up (most) gotos and replace with real states. - Add some unit tests. Fixes #70	13 years ago
Peter Griess	b115d110a3	Don't wait for EOF on 0-length KA messages. - Break EOF handling out of http_should_keep_alive() into http_message_needs_eof(), which we now use when determining what to do with a message of unknown length. This prevents us from falling into the s_body_identity_eof state in the cases where we actually do know the length of the message (e.g. because the response status was 204).	13 years ago
Peter Griess	248fbc3ab4	Get HTTP/1.1 message length logic working for HTTP/1.0 - Port message length logic from #72 to HTTP/1.0. - Add a bunch of unit tests for handling 0-length messages.	13 years ago
Peter Griess	d7675cd9a6	Add http_parser_parse_url(). - Add an http_parser_parse_url() method to parse a URL into its constituent components. This uses the same underlying parser as http_parser_parse() and doesn't do any data copies. - Re-add the URL components in various test.c structures; validate them when parsing.	13 years ago
Peter Griess	48a4364fdd	Remove some chars from tokens[] per RFC. - Treat ' ' specially, as apparently IIS6.0 can send this in headers. Allow this character through if we're not in strict mode. - Move some test code around so that test indices don't break when HTTP_PARSER_STRICT changes. Fixes #13.	13 years ago
koichik	b47c44d7a6	Fix response body is not read With HTTP/1.1, if neither Content-Length nor Transfer-Encoding is present, section 4.4 of RFC 2616 suggests http-parser needs to read a response body until the connection is closed (except the response must not include a body) See also joyent/node#2457. Fixes #72	13 years ago
Felix Geisendörfer	2498961231	Accept HTTP/0.9 responses See joyent/node#1711	13 years ago
Paul Querna	f1d48aa31c	Move all data to before code to fix http parser for c89.	13 years ago
Fouad Mardini	2b2ba2da1a	rename parser->errno to parser->http_errno; conflicts with errno.h where errno is defined as a macro	14 years ago
Peter Griess	53adfacad1	API CHANGE: Remove path, query, fragment CBs. - Get rid of support for these callbacks in http_parser_settings. - Retain state transitions between different URL portions in http_parser_execute() so that we're making the same correctness guarantees as before. - These are being removed because making multiple callbacks for the same byte makes it more difficult to pause the parser.	14 years ago
Peter Griess	49faf2e9cd	Merge pull request #53 from pgriess/callback_noclear Get rid of CALLBACK_NOCLEAR().	14 years ago
Peter Griess	5469827542	Get rid of CALLBACK_NOCLEAR(). - This was only used by CALLBACK() (which then cleared the mark anyway), and the end of the http_parser_execute() body (after which they go out of scope).	14 years ago
Peter Griess	761a5eaeb1	Break out errno into its own field.	14 years ago
Jon Kolb	8153466643	Group POST refinements, test all request methods, make IS_ALPHA use LOWER internally	14 years ago
Peter Griess	9114e58a77	Facility to report detailed parsing errors. - Add http_errno enum w/ values for many parsing error conditions. Stash this in http_parser.state if the 0x80 bit is set. - Report line numbers on error generation if the (new) HTTP_PARSER_DEBUG cpp symbol is set. Increases http_parser struct size by 8 bytes in this case. - Add http_errno_*() methods to help turning errno values into human-readable messages.	14 years ago
Peter Griess	056bcd3672	Merge pull request #49 from pgriess/upgrade-off-by-one Fix off-by-one in handling upgrade bodies.	14 years ago
Peter Griess	d4ca280af5	Fix off-by-one in handling upgrade bodies. - When handling upgraded bodies, http_parser_execute() used to return one fewer bytes parsed than expected. This caused the final LF to be interpreted by the caller as part of the body. - Add a bunch of upgrade body unit tests.	14 years ago
Cliff Frey	d5f0312eee	remove unused LOWER(ch)	14 years ago
Jon Kolb	a6934445e8	Allow uppercase chars in IS_ALPHANUM	14 years ago
Peter Griess	f684abdcc5	Merge pull request #27 from a2800276/master lowercasing in header after check for CR LF	14 years ago
Jon Kolb	dc314a3cb9	Return error when bad method starts with M or C	14 years ago
Sean Cunningham	b89f94414e	Support multi-line folding in header values. Normal value cb is called for subsequent lines. LWS is skipped. Note that \t whitespace character is now supported after header field name. RFC 2616, Section 2.2 "HTTP/1.1 header field values can be folded onto multiple lines if the continuation line begins with a space or horizontal tab. All linear white space, including folding, has the same semantics as SP. A recipient MAY replace any linear white space with a single SP before interpreting the field value or forwarding the message downstream."	14 years ago
Cliff Frey	3258e4a455	Fix build when char is unsigned by default. I tested by building/testing with -funsigned-char. Thanks to apaprocki for pointing out this problem.	14 years ago
Ryan Dahl	eee60127c0	Support PATCH method Requested in https://groups.google.com/forum/#!topic/nodejs-dev/iEOyiDkJRLs	14 years ago
Peter Griess	3bd18a779e	IS_* macros for char classes. - Add IS_ALPHA(), IS_NUM(), IS_HOST_CHAR(), etc. macros for determining membership in a character class. HTTP_PARSER_STRICT causes some of these definitions to change. - Support '_' character in hostnames in non-strict mode. - Support leading digits in hostnames when the method is HTTP_CONNECT. - Don't re-define HTTP_PARSER_STRICT in http_parser.h if it's already defined. - Tweak Makefile to run non-strict-mode unit tests. Rearrange non-strict mode unit tests in test.c. - Add test_fast to .gitignore. Fixes #44	14 years ago
Ryan Dahl	2839784927	HTTP_STRICT ifdefs out behavior introduced in `50b9bec` Fixes #37.	14 years ago
Peter Griess	b1c2cf83fd	Expose F_* flags as public API. Fixes #42.	14 years ago
Ryan Dahl	8dabce6ec7	It was pointed out we're missing attribution to NGINX	14 years ago
Peter Griess	9639c7c21c	Support ?-terminated hostnames per RFC 2396.3.2. - Bust out of s_req_host and s_req_port on '?'. - Add tests for query string parsing. Fixes #38.	14 years ago
Peter Griess	50b9bec552	Allow octets > 127 in path components. - This is non-spec behavior, but it appears that most HTTP servers implicitly support non-ASCII characters when parsing path components. Extend http-parser to allow this. - Fill out slots [128, 256) in normal_url_char[] with 1 so that these high octets are accepted in path components. - Add unit test for paths that include such non-ASCII characters. Fixes #37.	14 years ago
Ryan Dahl	63daf22f2c	Update copyright headers	14 years ago
Sean Cunningham	10270007bc	Avoid chunk header parsing overflow. Recharacterize the chunk header states such that they are bound by the check for HTTP_MAX_HEADER_SIZE.	14 years ago
Sean Cunningham	81ca70aec1	Avoid chunk trailer overflow. Check for overflow during chunk trailer by removing unnecessary check in macro PARSING_HEADER. This will force the parser to abort if the chunk trailer contains more than HTTP_MAX_HEADER_SIZE of data.	14 years ago
Ryan Dahl	1c3624a963	Detect errors on EOF	14 years ago
Ryan Dahl	fcdbc2629f	Add hack for tmm1	14 years ago
Tim Becker	9656fd73de	moved unecessary lookup	14 years ago
Nathan Rajlich	f825b52b7f	Added support for "SUBSCRIBE" and "UNSUBSCRIBE" request methods.	14 years ago
Nathan Rajlich	d56a0700d0	Add support for "M-SEARCH" and "NOTIFY" request methods. Allow a request path of "*" (for SSDP requests).	14 years ago
Nathan Rajlich	84578ae7a8	Set http_major when a request omits the HTTP version I.E. "GET /" in telnet	14 years ago
Ryan Dahl	37e9009369	Digits in hostname on CONNECT req allowed	14 years ago
Cliff Frey	90320fde7a	Remove acceptable_header array This was not necessary, as it was just being used as a downcase function.	14 years ago
Ryan Dahl	51de89f8b0	Accept tokens + SP for header fields	14 years ago
Ewen Cheslack-Postava	24be793f64	Provide typedefs instead of using stdint.h on Windows.	14 years ago
Nathan Rajlich	a66c61c190	Allow whitespace in the 'Content-Length' header.	14 years ago
Cliff Frey	459507f534	avoid assertion failure in error case Without this change, it is possible to get an assertion to fail by continuing to call http_parser_execute after it has returned an error. Specifically, the parser could be called with parser->state == s_chunk_size_almost_done and parser->flags & F_CHUNKED set. Then, F_CHUNKED could have been cleared, and an error could be hit. In this case, the parser would have returned with F_CHUNKED clear, but parser->state == s_chunk_size_almost_done, resulting in an assertion failure on the next call. There are alternate solutions possible, including just saving all of the fields (state included) on error. I didn't add a test case because this is a bit annoying to test, but I can add one if necesssary.	14 years ago
Ben Noordhuis	cbb194ea8c	Replace C++ style comments with C comments so it compiles with `gcc -ansi -Wall`	14 years ago
Cliff Frey	ca2514dd3a	Array type cleanups. Also save space acceptable_header[x] is always assigned to a variable of type char, so the 'unsigned' is unnecessary. The other arrays can be of type int8_t/uint8_t to save space.	14 years ago
Cliff Frey	423c90d9fe	fixes for architectures with signed char default This could have resulted in memory before the normal_url_char array being read on architectures with signed char default.	14 years ago
Ryan Dahl	6f12467a8a	Use lookup tables of my own.	15 years ago
Jeff Terrace	d0dfc98773	Initialize method member to avoid falsely upgrading connections. Fixed Issue #7	15 years ago
Ryan Dahl	a59ba4d866	Support long messages	15 years ago
Ryan Dahl	120f0f6e09	Allow spaces in header fields	15 years ago
Ryan Dahl	5f27ea8179	Fix long line	15 years ago

1 2 3 4 5

216 Commits (master)