http-parser

Commit Graph

Author	SHA1	Message	Date
Bertrand Paquet	9dfaa67f2b	Check host on url with hostname:port	13 years ago
Martell Malone	9852e5d048	test: %zu to %lu for msvcrt fixes for Mingw64	13 years ago
Ben Noordhuis	b97fdb0513	Don't assert() on whitespace in URL. Be lenient about tabs and form feeds in non-strict mode.	13 years ago
Nathan Rajlich	a3373d7627	add support for "SEARCH" request methods	13 years ago
Nathan Rajlich	5a1e556239	test.c: fix off-by-one on the requests test cases	13 years ago
Ben Noordhuis	99c0850240	test: abort(), don't exit() Makes it easier to debug failing test cases: abort() dumps core and asserts in a debugger.	13 years ago
Ben Noordhuis	62110efe7a	Support PURGE request method. Fixes joyent/node#2775.	13 years ago
David Gwynne	662e523a92	fix non-CONNECT tests missing port/hostname bits set is_connect properly	13 years ago
David Gwynne	67568421e9	allow extra ? at the beginning of a query_string. fixes joyent/http-parser issue #25	13 years ago
David Gwynne	8da60bc423	implement parsing of v6 addresses and rejection of 0-length host and ports. the v6 parsing works by adding extra states for working with the [] notation for v6 addresses. hosts and ports cannot be 0-length because we url parsing from ending when we expect those fields to begin. http_parser_parse_url gets a free check for the correctness of CONNECT urls (they can only be host:port). this addresses the following issues: i was bored and had my head in this space.	13 years ago
David Gwynne	0499525110	Fix http_parser_parse_url for urls like "http://host/path ". Before this change it would include the last slash in the separator between the schema and host as part of the host. we cant use the trick used for skipping the separator before ports, query strings, and fragments because if it was a CONNECT style url string (host:port) it would skip the first character of the hostname. Work around this by introducing a few more states to represent these separators in a url differently to what theyre separating. this in turn lets us simplify the url parsing so can simply skip what it considers delimiters rather than having to special case certain types of url parts and skip their prefixes. Add tests for the http_parser_parse_url(). This compares the http_parser_url struct that http_parser_parse_url() produces against one that we expect from the test. If they differ then http_parser_parse_url() misbehaved.	13 years ago
Ben Noordhuis	c3153bd1a9	Eat CRLF between requests, even on connection:close. Fixes #47.	13 years ago
Ben Noordhuis	f668e72380	Make content_length unsigned, add overflow checks.	13 years ago
Ivo Raisr	2a2f99f9cd	http_parser_init does not clear status_code	13 years ago
Peter Griess	eb04bbe1fa	Merge pull request #73 from pgriess/http-10-message-length Get HTTP/1.1 message length logic working for HTTP/1.0	13 years ago
Peter Griess	d0bb867d1b	Implement http_parser_pause(). Summary: - Add http_parser_pause() API. A callback may invoke this at any time. This will cause http_parser_parse() to return indicating that it parsed less than the number of requested bytes and set an error to HBE_PAUSED. A paused parser with fail with HBE_PAUSED until it is un-paused with http_parser_pause(). - Stop using 'state', 'header_state', 'index', and 'nread' shadow variables and then updating their http_parser fields when we're done. Instead, update the live values as we go. This will make it possible to return from anywhere in the parser (say, due to EPAUSED) and have valid/expected state. - Update state before making callbacks so that if the want to pause, we'll know the correct state already. - Make sure that every callback has a state that uniquely identifies the next step so that we can resume in the right place if we were suppoed to be paused. - Clean and re-factor up CALLBACK() macros. - Use CALLBACK() macros for (almost) all callbacks; on_headers_complete is still a special case. This includes on_body which we used to invoke manually with a long run of bytes. We now use a 'body' mark and hit its callback just like every other data callback. - Clean up (most) gotos and replace with real states. - Add some unit tests. Fixes #70	13 years ago
Peter Griess	b115d110a3	Don't wait for EOF on 0-length KA messages. - Break EOF handling out of http_should_keep_alive() into http_message_needs_eof(), which we now use when determining what to do with a message of unknown length. This prevents us from falling into the s_body_identity_eof state in the cases where we actually do know the length of the message (e.g. because the response status was 204).	13 years ago
Peter Griess	248fbc3ab4	Get HTTP/1.1 message length logic working for HTTP/1.0 - Port message length logic from #72 to HTTP/1.0. - Add a bunch of unit tests for handling 0-length messages.	13 years ago
Peter Griess	d7675cd9a6	Add http_parser_parse_url(). - Add an http_parser_parse_url() method to parse a URL into its constituent components. This uses the same underlying parser as http_parser_parse() and doesn't do any data copies. - Re-add the URL components in various test.c structures; validate them when parsing.	13 years ago
Peter Griess	48a4364fdd	Remove some chars from tokens[] per RFC. - Treat ' ' specially, as apparently IIS6.0 can send this in headers. Allow this character through if we're not in strict mode. - Move some test code around so that test indices don't break when HTTP_PARSER_STRICT changes. Fixes #13.	13 years ago
koichik	b47c44d7a6	Fix response body is not read With HTTP/1.1, if neither Content-Length nor Transfer-Encoding is present, section 4.4 of RFC 2616 suggests http-parser needs to read a response body until the connection is closed (except the response must not include a body) See also joyent/node#2457. Fixes #72	13 years ago
Felix Geisendörfer	2498961231	Accept HTTP/0.9 responses See joyent/node#1711	13 years ago
Peter Griess	53adfacad1	API CHANGE: Remove path, query, fragment CBs. - Get rid of support for these callbacks in http_parser_settings. - Retain state transitions between different URL portions in http_parser_execute() so that we're making the same correctness guarantees as before. - These are being removed because making multiple callbacks for the same byte makes it more difficult to pause the parser.	14 years ago
Jon Kolb	8153466643	Group POST refinements, test all request methods, make IS_ALPHA use LOWER internally	14 years ago
Peter Griess	9114e58a77	Facility to report detailed parsing errors. - Add http_errno enum w/ values for many parsing error conditions. Stash this in http_parser.state if the 0x80 bit is set. - Report line numbers on error generation if the (new) HTTP_PARSER_DEBUG cpp symbol is set. Increases http_parser struct size by 8 bytes in this case. - Add http_errno_*() methods to help turning errno values into human-readable messages.	14 years ago
Peter Griess	ddbbc07c10	Fix minor compilation bug introduced by merge.	14 years ago
Peter Griess	056bcd3672	Merge pull request #49 from pgriess/upgrade-off-by-one Fix off-by-one in handling upgrade bodies.	14 years ago
Peter Griess	d4ca280af5	Fix off-by-one in handling upgrade bodies. - When handling upgraded bodies, http_parser_execute() used to return one fewer bytes parsed than expected. This caused the final LF to be interpreted by the caller as part of the body. - Add a bunch of upgrade body unit tests.	14 years ago
Jon Kolb	a6934445e8	Allow uppercase chars in IS_ALPHANUM	14 years ago
Jon Kolb	dc314a3cb9	Return error when bad method starts with M or C	14 years ago
Sean Cunningham	b89f94414e	Support multi-line folding in header values. Normal value cb is called for subsequent lines. LWS is skipped. Note that \t whitespace character is now supported after header field name. RFC 2616, Section 2.2 "HTTP/1.1 header field values can be folded onto multiple lines if the continuation line begins with a space or horizontal tab. All linear white space, including folding, has the same semantics as SP. A recipient MAY replace any linear white space with a single SP before interpreting the field value or forwarding the message downstream."	14 years ago
Ryan Dahl	eee60127c0	Support PATCH method Requested in https://groups.google.com/forum/#!topic/nodejs-dev/iEOyiDkJRLs	14 years ago
Ryan Dahl	1efd9ac6a0	Number HOSTNAME_UNDERSCORE test	14 years ago
Peter Griess	3bd18a779e	IS_* macros for char classes. - Add IS_ALPHA(), IS_NUM(), IS_HOST_CHAR(), etc. macros for determining membership in a character class. HTTP_PARSER_STRICT causes some of these definitions to change. - Support '_' character in hostnames in non-strict mode. - Support leading digits in hostnames when the method is HTTP_CONNECT. - Don't re-define HTTP_PARSER_STRICT in http_parser.h if it's already defined. - Tweak Makefile to run non-strict-mode unit tests. Rearrange non-strict mode unit tests in test.c. - Add test_fast to .gitignore. Fixes #44	14 years ago
Ryan Dahl	2839784927	HTTP_STRICT ifdefs out behavior introduced in `50b9bec` Fixes #37.	14 years ago
Peter Griess	9639c7c21c	Support ?-terminated hostnames per RFC 2396.3.2. - Bust out of s_req_host and s_req_port on '?'. - Add tests for query string parsing. Fixes #38.	14 years ago
Peter Griess	50b9bec552	Allow octets > 127 in path components. - This is non-spec behavior, but it appears that most HTTP servers implicitly support non-ASCII characters when parsing path components. Extend http-parser to allow this. - Fill out slots [128, 256) in normal_url_char[] with 1 so that these high octets are accepted in path components. - Add unit test for paths that include such non-ASCII characters. Fixes #37.	14 years ago
Ryan Dahl	63daf22f2c	Update copyright headers	14 years ago
Ryan Dahl	1c3624a963	Detect errors on EOF	14 years ago
Nathan Rajlich	d56a0700d0	Add support for "M-SEARCH" and "NOTIFY" request methods. Allow a request path of "*" (for SSDP requests).	14 years ago
Ryan Dahl	03970a576d	Test that it can handle $ in header field	14 years ago
Nathan Rajlich	84578ae7a8	Set http_major when a request omits the HTTP version I.E. "GET /" in telnet	14 years ago
Ryan Dahl	b75cea580a	Test for dots at the begging on header fields	14 years ago
Ryan Dahl	04bc364610	Make sure it can handle spaces in content-length	14 years ago
Ryan Dahl	37e9009369	Digits in hostname on CONNECT req allowed	14 years ago
Ryan Dahl	fb875caa43	Add non-ascii in status line test from Ben Noordhuis	14 years ago
Ryan Dahl	51de89f8b0	Accept tokens + SP for header fields	14 years ago
Ryan Dahl	c1d48fdce8	Changes to compile with clang	15 years ago
Ryan Dahl	c7c242d55c	typo	15 years ago
Ryan Dahl	120f0f6e09	Allow spaces in header fields	15 years ago
Santiago Gala	0264a0aefc	Upgrade on CONNECT method	15 years ago
Cliff Frey	c83a018d05	test: always try and break every testcase up into two submessages This is just another way that would have caught the bug introduced in `076fa15132` and fixed by `03b8eaa5f8`.	15 years ago
Cliff Frey	5dd740304f	test.c: get it to work with valgrind by using realloc less For some reason valgrind would rapidly run out of memory on my machine without this.	15 years ago
Ryan Dahl	03b8eaa5f8	Reset url_mark on s_req_host add a new scan test. Report and fix by Master Becker.	15 years ago
Ryan Dahl	9dc258f9dd	Add subversion request methods REPORT, MKACTIVITY, CHECKOUT, MERGE	15 years ago
Cliff Frey	7a1103ae53	add tests of method strings Currently this test fails, because short method strings do not cause failures, even if they are unknown methods. However, long unknown method strings do cause errors.	15 years ago
Cliff Frey	634c3a6d26	test: fix compile warnings about printf + size_t	15 years ago
Cliff Frey	b413961182	test.c: add cases for header overflow conditions This currently fails, but the next commit fixes the issue.	15 years ago
Ryan Dahl	4cf39fd2fa	Support request URLs without schema Test case from Poul T Lomholt <pt@lomholt.com>	15 years ago
Ryan Dahl	cdda8b6a60	Support empty header values Test case by Pierre Ruyssen <pierre@ruyssen.fr>	15 years ago
Ryan Dahl	8beed7ef17	Fix whitespace	15 years ago
Cliff Frey	b8c3336f5d	add support for HTTP_BOTH This is good for analyzing raw streams of data when one is not sure which direction it will be in.	15 years ago
Cliff Frey	7239788205	pass pointer to settings structure rather than pass by value	15 years ago
Ryan Dahl	7cfa645fc7	Fix long chunked message bug The HTTP_MAX_HEADER_SIZE was being consulted at the end of the chunked message (when you look for trailing headers). http://github.com/ry/node/issues#issue/77	15 years ago
Ryan Dahl	88d11b394d	Support Upgrade header	15 years ago
Ryan Dahl	e09651c6bb	cross platfom size_t printing	15 years ago
Ryan Dahl	dbd2dad461	Introduce http_parser_settings	15 years ago
Ryan Dahl	8243fddd17	Fix c++ and mac compile errors	15 years ago
Cliff Frey	d5a900264f	Allow newlines before HTTP requests. I have seen cases where a browser will POST data, and then send an extra CRLF before issuing the next request.	15 years ago
Cliff Frey	f167565742	Allow '_' in header fields. Technically anything defined as a 'token' by http://www.w3.org/Protocols/rfc2616/rfc2616-sec2.html#sec2.2 should be allowed, which includes !#$%^&*+-.`~\| and probably others. However this is the only one that I have found in use.	15 years ago
Cliff Frey	6409a5bd17	Allow extra '?' in query strings, and add a test for it.	15 years ago
Ryan Dahl	9cbd66e49a	Support 'Proxy-Connection' header See http://www.http-stats.com/Proxy-Connection	15 years ago
Ryan Dahl	caef58793e	Update license for 2010	15 years ago
Ryan Dahl	1a677040c0	API: Define parser type in http_parser_init() That is, for a request parser do this: http_parser_init(my_parser, HTTP_REQUEST) for a response parser do this: http_parser_init(my_parser, HTTP_RESPONSE) Then http_parse_requests() and http_parse_responses() both turn into http_parer_execute().	15 years ago
Ryan Dahl	6108b765ce	Bugfix: sometimes servers send \n instead of \r\n	15 years ago
Ryan Dahl	1d9ebac036	Revert "Add method -> string lookup" This reverts commit `b795f94686`. (I don't like this feature, and I'm not using it.)	15 years ago
Ryan Dahl	79947a7334	Remove EOL whitespace	15 years ago
Ryan Dahl	b795f94686	Add method -> string lookup	15 years ago
Ryan Dahl	2fc9c8d801	Accidentally commented out a test	15 years ago
Ryan Dahl	51e9ff0314	Fix initialization bug. Heap allocate parser in tests, to get errors with valgrind.	15 years ago
Ryan Dahl	a8f7a3cd78	add message_complete_on_eof test	15 years ago
Ryan Dahl	bd291ab5d8	add license file	15 years ago
Ryan Dahl	4226a8f63b	add tests for should_keep_alive()	15 years ago
Ryan Dahl	5b00b6a64f	add http_should_keep_alive()	15 years ago
Ryan Dahl	5b37977e32	Don't put should_keep_alive messages in front of messages	15 years ago
Ryan Dahl	8f52d451a6	Add http version to tests	15 years ago
Ryan Dahl	ca1e011ab3	add response scan, fix persistent bug	15 years ago
Ryan Dahl	3ac0ebdee5	Passing tests	15 years ago
Ryan Dahl	0642366f0e	change around api	15 years ago
Ryan Dahl	3834853a8a	uri -> url	15 years ago
Ryan Dahl	6cefbc13af	all scans works	15 years ago
Ryan Dahl	b71a17ec85	better output for test_scan	15 years ago
Ryan Dahl	0b8a48049c	Handling chunked messages	15 years ago
Ryan Dahl	a0476a08a0	better output on errors in test program	15 years ago
Ryan Dahl	c5a92f792f	Now parsing some req headers	15 years ago
Ryan Dahl	433202d825	new version Trashing the old Ragel parser (which was based on Mongrel) because it's proving difficult to get the control I need in end-of-message cases. Replacing this with a hand written parser using a couple tricks borrowed from NGINX. The new parser will be much more work to write, but should prove faster and allow for better hacking.	15 years ago
Phoenix Sol	6bfd5bf76d	add ab to test	15 years ago
Ryan Dahl	2769741ba5	Fix LICENSE	15 years ago
Ryan Dahl	d827cb368c	Allow quotes in URI IE6 apparently sends such requests... Reported by Michael Carter.	15 years ago
Ryan	dbbc73c16f	API Change: Return void from http_parser_execute().	15 years ago

1 2 3 4

164 Commits (edeedb1b4d2f34e4c7d8045ac8b92adbc35e7ed7)