HTTP 폴링, 롱 폴링, HTTP 스트리밍 및 웹 소켓에 대한 이해

Programming

HTTP 폴링, 롱 폴링, HTTP 스트리밍 및 웹 소켓에 대한 이해

procodes 2020. 8. 4. 19:39

HTTP 폴링, 롱 폴링, HTTP 스트리밍 및 웹 소켓에 대한 이해

내 질문 제목의 키워드에 대한 SO 및 웹의 많은 게시물을 읽었으며 많은 내용을 배웠습니다. 내가 읽은 질문 중 일부는 특정 구현 문제와 관련이 있으며 다른 질문은 일반적인 개념에 중점을 둡니다. 기술 X가 기술 Y 등에서 발명 된 이유와 모든 개념을 이해했는지 확인하고 싶습니다. 그래서 여기에 간다 :

Http Polling : 기본적으로 XmlHttpRequest를 사용하는 AJAX

Http Long Polling : AJAX이지만 서버에 업데이트가없는 한 서버는 응답을 유지합니다. 서버에 업데이트가있는 즉시 서버가이를 보내면 클라이언트가 다른 요청을 보낼 수 있습니다. 단점은 추가 오버 헤드를 유발하는 앞뒤로 전송해야하는 추가 헤더 데이터입니다.

Http Streaming : 긴 폴링 과 유사하지만 서버는 "Transfer Encoding : chunked"라는 헤더로 응답하므로 서버가 데이터를 보낼 때마다 새 요청을 시작할 필요가 없으므로 추가 헤더 오버 헤드를 절약 할 수 있습니다. 여기서 단점은 서버가 보낸 여러 청크를 구별하기 위해 데이터의 구조를 "이해하고"이해해야한다는 것입니다.

Java Applet, Flash, Silverlight : tcp / ip를 통해 소켓 서버에 연결하는 기능을 제공하지만 플러그인이기 때문에 개발자는 이에 의존하고 싶지 않습니다.

WebSockets : 위의 메소드의 단점을 다음과 같은 방식으로 해결하려고 시도하는 새로운 API입니다.

Java Applets, Flash 또는 Silverlight와 같은 플러그인에 비해 WebSockets의 유일한 장점은 WebSockets가 기본적으로 브라우저에 내장되어 있으며 플러그인에 의존하지 않는다는 것입니다.
http 스트리밍에 비해 WebSockets의 유일한 장점은 수신 된 데이터를 "이해"하고 파싱하려고 할 필요가 없다는 것입니다.
Long Polling에 비해 WebSockets의 유일한 장점은 추가 헤더 크기를 제거하고 요청을 위해 소켓 연결을 열고 닫을 수 있다는 것입니다.

누락 된 다른 중요한 차이점이 있습니까? SO에 이미있는 많은 질문을 하나의 질문으로 다시 요청하거나 결합하는 경우 미안하지만 SO와 웹에있는 모든 정보에서 이러한 개념과 관련하여 완벽하게 이해하고 싶습니다.

감사!

식별 한 것보다 더 많은 차이점이 있습니다.

이중 / 방향 :

단방향 : HTTP 폴링, 긴 폴링, 스트리밍.
양방향 : WebSockets, 플러그인 네트워킹

지연 시간을 늘리려면 (대략) :

웹 소켓
플러그인 네트워킹
HTTP 스트리밍
HTTP 긴 설문 조사
HTTP 폴링

CORS (교차 출처 지원) :

Web 소켓 : 예
플러그인 네트워킹 : 정책 요청을 통한 플래시 (다른 사람들에 대해서는 확실하지 않음)
HTTP * (일부 최근 지원)

기본 이진 데이터 (유형 배열, 얼룩) :

Web 소켓 : 예
플러그인 네트워킹 : Flash가 아님 (ExternalInterface에서 URL 인코딩 필요)
HTTP * : 이진 형식 지원을 활성화하기위한 최근 제안

효율성 감소의 대역폭 :

플러그인 네트워킹 : 초기 정책 요청을 제외하고 플래시 소켓은 원시입니다
WebSockets : 연결 설정 핸드 셰이크 및 프레임 당 몇 바이트
HTTP 스트리밍 (서버 연결 재사용)
HTTP 폴링 : 모든 메시지에 대한 연결
HTTP 폴 : 모든 메시지에 대한 연결 + 데이터 메시지 없음

휴대 기기 지원 :

WebSocket : iOS 4.2 이상 Flash 에뮬레이션을 통해 또는 Android 용 Firefox 또는 Android 용 Chrome을 사용하는 일부 Android 는 기본 WebSocket 지원을 제공합니다.
플러그인 네트워킹 : 일부 안드로이드. iOS가 아님
HTTP * : 대부분 예

자바 스크립트 사용의 복잡성 (가장 단순한 것에서 가장 복잡한 것까지) 분명히 복잡한 측정은 다소 주관적입니다.

웹 소켓
HTTP 폴
플러그인 네트워킹
HTTP 긴 폴링, 스트리밍

또한 Server-Sent Events 라는 HTTP 스트리밍 표준화를위한 W3C 제안이 있습니다 . 현재 상당히 초기 단계이며 WebSocket과 비슷한 단순성을 갖춘 표준 Javascript API를 제공하도록 설계되었습니다.

많은 근거를 다루는 다른 사람들의 훌륭한 답변. 약간의 추가 사항이 있습니다.

The only advantage of WebSockets over plugins like Java Applets, Flash or Silverlight is that WebSockets are natively built into browsers and does not rely on plugins.

If by this you mean that you can use Java Applets, Flash, or Silverlight to establish a socket connection, then yes, that is possible. However you don't see that deployed in the real world too often because of the restrictions.

For example, intermediaries can and do shutdown that traffic. The WebSocket standard was designed to be compatible with existing HTTP infrastructure and so is far less prone to being interfered with by intermediaries like firewalls and proxies.

Moreover, WebSocket can use port 80 and 443 without requiring dedicated ports, again thanks to the protocol design to be as compatible as possible with existing HTTP infrastructure.

Those socket alternatives (Java, Flash, and Silverlight) are difficult to use securely in a cross-origin architecture. Thus people often attempting to use them cross-origin will tolerate the insecurities rather than go to the effort of doing it securely.

They can also require additional "non-standard" ports to be opened (something administrators are loathe to do) or policy files that need to be managed.

In short, using Java, Flash, or Silverlight for socket connectivity is problematic enough that you don't see it deployed in serious architectures too often. Flash and Java have had this capability for probably at least 10 years, and yet it's not prevalent.

The WebSocket standard was able to start with a fresh approach, bearing those restrictions in mind, and hopefully having learned some lessons from them.

Some WebSocket implementations use Flash (or possibly Silverlight and/or Java) as their fallback when WebSocket connectivity cannot be established (such as when running in an old browser or when an intermediary interferes).

While some kind of fallback strategy for those situations is smart, even necessary, most of those that use Flash et al will suffer from the drawbacks described above. It doesn't have to be that way -- there are workarounds to achieve secure cross-origin capable connections using Flash, Silverlight, etc -- but most implementations won't do that because it's not easy.

For example, if you rely on WebSocket for a cross-origin connection, that will work fine. But if you then run in an old browser or a firewall/proxy interfered and rely on Flash, say, as your fallback, you will find it difficult to do that same cross-origin connection. Unless you don't care about security, of course.

That means it's difficult have a single unified architecture that works for native and non-native connections, unless you're prepared to put in quite a bit of work or go with a framework that has done it well. In an ideal architecture, you wouldn't notice if the connections were native or not; your security settings would work in both cases; your clustering settings would still work; your capacity planning would still hold; and so on.

The only advantage of WebSockets over http streaming is that you don't have to make an effort to "understand" and parse the data received.

It's not as simple as opening up an HTTP stream and sitting back as your data flows for minutes, hours, or longer. Different clients behave differently and you have to manage that. For example some clients will buffer up the data and not release it to the application until some threshold is met. Even worse, some won't pass the data to the application until the connection is closed.

So if you're sending multiple messages down to the client, it's possible that the client application won't receive the data until 50 messages worth of data has been received, for example. That's not too real-time.

While HTTP streaming can be a viable alternative when WebSocket is not available, it is not a panacea. It needs a good understanding to work in a robust way out in the badlands of the Web in real-world conditions.

Are there any other significant differences that I am missing?

There is one other thing that noone has mentioned yet, so I'll bring it up.

The WebSocket protocol was designed to a be a transport layer for higher-level protocols. While you can send JSON messages or what-not directly over a WebSocket connection, it can also carry standard or custom protocols.

For example, you could do AMQP or XMPP over WebSocket, as people have already done. So a client could receive messages from an AMQP broker as if it were connected directly to the broker itself (and in some cases it is).

Or if you have an existing server with some custom protocol, you can transport that over WebSocket, thus extending that back-end server to the Web. Often an existing application that has been locked in the enterprise can broaden it's reach using WebSocket, without having to change any of the back-end infrastructure.

(Naturally, you'd want to be able to do all that securely so check with the vendor or WebSocket provider.)

Some people have referred to WebSocket as TCP for the Web. Because just like TCP transports higher-level protocols, so does WebSocket, but in a way that's compatible with Web infrastructure.

So while sending JSON (or whatever) messages directly over WebSocket is always possible, one should also consider existing protocols. Because for many things you want to do, there's probably a protocol that's already been thought of to do it.

I'm sorry if I am re-asking or combining many of the questions already on SO into a single question, but I just want to make perfect sense out of all the info that is out there on SO and the web regarding these concepts.

This was a great question, and the answers have all been very informative!

If I may ask one additional thing: I came across in an article somewhere that says that http streaming may also be cached by proxies while websockets are not. what does that mean?

(StackOverflow limits the size of comment responses, so I've had to answer here rather than inline.)

That's a good point. To understand this, think about a traditional HTTP scenario... Imagine a browser opened a web page, so it requests http://example.com, say. The server responds with HTTP that contains the HTML for the page. Then the browser sees that there are resources in the page, so it starts requesting the CSS files, JavaScript files, and images of course. They are all static files that will be the same for all clients requesting them.

Some proxies will cache static resources so that subsequent requests from other clients can get those static resources from the proxy, rather than having to go all the way back to the central web server to get them. This is caching, and it's a great strategy to offload requests and processing from your central services.

So client #1 requests http://example.com/images/logo.gif, say. That request goes through the proxy all the way to the central web server, which serves up logo.gif. As logo.gif passes through the proxy, the proxy will save that image and associate it with the address http://example.com/images/logo.gif.

When client #2 comes along and also requests http://example.com/images/logo.gif, the proxy can return the image and no communication is required back to the web server in the center. This gives a faster response to the end user, which is always great, but it also means that there is less load on the center. That can translate to reduced hardware costs, reduced networking costs, etc. So it's a good thing.

The problem arises when the logo.gif is updated on the web server. The proxy will continue to serve the old image unaware that there is a new image. This leads to a whole thing around expiry so that the proxy will only cache the image for a short time before it "expires" and the next request goes through the proxy to the web server, which then refreshes the proxy's cache. There are also more advanced solutions where a central server can push out to known caches, and so on, and things can get pretty sophisticated.

How does this tie in to your question?

You asked about HTTP streaming where the server is streaming HTTP to a client. But streaming HTTP is just like regular HTTP except you don't stop sending data. If a web server serves an image, it sends HTTP to the client that eventually ends: you've sent the whole image. And if you want to send data, it's exactly the same, but the server just sends for a really long time (like it's a massively gigantic image, say) or even never finishes.

From the proxy's point of view, it cannot distinguish between HTTP for a static resource like an image, or data from HTTP streaming. In both of those cases, the client made a request of the server. The proxy remembered that request and also the response. The next time that request comes in, the proxy serves up the same response.

So if your client made a request for stock prices, say, and got a response, then the next client may make the same request and get the cached data. Probably not what you want! If you request stock prices you want the latest data, right?

So it's a problem.

There are tricks and workarounds to handle problems like that, it is true. Obviously you can get HTTP streaming to work since it's it's in use today. It's all transparent to the end user, but the people who develop and maintain those architectures have to jump through hoops and pay a price. It results in over-complicated architectures, which means more maintenance, more hardware, more complexity, more cost. It also means developers often have to care about something they shouldn't have to when they should just be focussing on the application, GUI, and business logic -- they shouldn't have to be concerned about the underlying communication.

HTTP limits the number of connections a client can have with a server to 2 (although this can be mitigated by using subdomains) and IE has been known to enforce this eagerly. Firefox and Chrome allow more (although I can't remember of the top of my head exactly how many). This might not seem like a huge issue but if you are using 1 connection constantly for real-time updates, all other requests have to bottleneck through the other HTTP connection. And there is the matter of having more open connections from clients puts more load on the server.

WebSockets are a TCP-based protocol and as such don't suffer from this HTTP-level connection limit (but, of course, browser support is not uniform).

참고URL : https://stackoverflow.com/questions/12555043/my-understanding-of-http-polling-long-polling-http-streaming-and-websockets

'Programming' 카테고리의 다른 글

pip가 버전 10에서 버전 18로 업그레이드 한 이유는 무엇입니까? (0)	2020.08.04
동일한 솔루션 / 프로젝트에서 Visual Studio를 사용하여 32 비트 및 64 비트 모두 대상 (0)	2020.08.04
인공 신경 네트워크에 대해 배우기위한 좋은 자료는 무엇입니까? (0)	2020.08.04
LINQ to Entities가 방법을 인식하지 못합니다 (0)	2020.08.04
풍부한 Word 문서를 만들 수있는 Java API가 있습니까? (0)	2020.08.04

현재글HTTP 폴링, 롱 폴링, HTTP 스트리밍 및 웹 소켓에 대한 이해

procodes

HTTP 폴링, 롱 폴링, HTTP 스트리밍 및 웹 소켓에 대한 이해

HTTP 폴링, 롱 폴링, HTTP 스트리밍 및 웹 소켓에 대한 이해

'Programming' 카테고리의 다른 글

'Programming'의 다른글

티스토리툴바

HTTP 폴링, 롱 폴링, HTTP 스트리밍 및 웹 소켓에 대한 이해

HTTP 폴링, 롱 폴링, HTTP 스트리밍 및 웹 소켓에 대한 이해

'Programming' 카테고리의 다른 글

'Programming'의 다른글

관련글

티스토리툴바