To install click the Add extension button. That's it.

The source code for the WIKI 2 extension is being checked by specialists of the Mozilla Foundation, Google, and Apple. You could also do it yourself at any point in time.

Kelly Slayton
Congratulations on this excellent venture… what a great idea!
Alexander Grigorievskiy
I use WIKI 2 every day and almost forgot how the original Wikipedia looks like.
What we do. Every page goes through several hundred of perfecting techniques; in live mode. Quite the same Wikipedia. Just better.

From Wikipedia, the free encyclopedia

In connection-oriented communication, a data stream is the transmission of a sequence of digitally encoded signals to convey information.[1] Typically, the transmitted symbols are grouped into a series of packets.[2]

Data streaming has become ubiquitous. Anything transmitted over the Internet is transmitted as a data stream. Using a mobile phone to have a conversation transmits the sound as a data stream.

YouTube Encyclopedic

  • 1/3
    3 326
    23 265
    26 236
  • Hacks Weekly #26: Hacking SQL Server Traffic: Tabular Data Stream
  • Find Median from Data Stream


Formal definition

In a formal way, a data stream is any ordered pair where:

  1. is a sequence of tuples and
  2. is a sequence of positive real time intervals.


Data Stream contains different sets of data, that depend on the chosen data format.

  • Attributes – each attribute[3] of the data stream represents a certain type of data, e.g. segment / data point ID, timestamp, geodata.
  • Timestamp attribute helps to identify when an event occurred.
  • Subject ID is an encoded-by-algorithm ID, that has been extracted out of a cookie.
  • Raw Data includes information straight from the data provider without being processed by an algorithm nor human.
  • Processed Data is a data that has been prepared[4] (somehow modified, validated or cleaned), to be used for future actions.


There are various areas where data streams are used:

  • Fraud detection & scoring – raw data is used as source data for an anti-fraud algorithm (data analysis techniques for fraud detection). For example, timestamps, cookie occurrences or analysis of data points are used within the scoring system to detect fraud or to make sure that a message receiver is not a bot (so-called Non-Human Traffic[5]).
  • Artificial intelligence – raw data is treated like a train set and a test set during AI and machine learning algorithms building.
  • Raw data is used for profiling and personalization to customize user profiles[6] and divide them for segmentation, e.g., per gender or location (based on data point).
  • Business intelligence – raw data is a source of information for BI systems, used for enriching user profiles with detailed information about them, e.g., purchase path or geodata. This information is used for business analysis and predictive research.
  • Targeting – processed data by data scientists improve online campaigns and is used for reaching the target audience.[7]
  • CRM Enrichment – raw data is integrated with customer-relationship management system. CRM integration allows to fill the gaps in users' profiles with demographic data, interests or buying intentions.


Core integrations with data streams are:

  • Data streams are integrated with systems such as customer data platform (CDP), customer relationship management (CRM) or data management platform (DMP) to enrich users' profiles with external data. It is possible to expand the knowledge about existing users by using external sources.[8]
  • Data streams are used to enrich business intelligence systems and make analysis more precise and conclusions more accurate.
  • In the case of content management system (CMS) integration, Data Stream is used to identify the users and personalize their visit, even if it's their first one. By data analysis, the actual content of the website is adapted to the user.
  • Data streams are integrated with demand side platform (DSP) within programmatic advertising ecosystem. Parties (e.g., advertisers) can exchange the users' IDs and concatenate with them existing profiles.
  • Data streams are used to choose respective user segments (e.g., people interested in the automotive industry) and use them in an online campaign. Segments are enriched with more user characteristics out of data stream and then sent to DSP.

Data sources visible

In a data stream it is visible what device has been used by the user side – it is visible on user agent:

  • mobile – when a user uses a mobile browser to explore, it has narrow screen resolution and mobile app version, respectively;
  • desktop – when a user uses a desktop browser or app version.

The following information is shared out of used device:


A data point is a tag that collects information about a certain action, performed by a user on a website. Data points exists in two types, the values of which are used to create appropriate audiences. Those are:

  • 'event' with information about occurrences of the specific event (e.g., click on a link or displaying ad)
  • 'attribute' with numerical or alphanumerical values.

Segment is a logical statement, built on specific Data Points using AND, OR or NOT operators.[9]
Hybrid data – raw data out of both Data Point and Segment data formats.[10]
URLs – is a set of information about a particular URL that has been visited.


Information gathered out of websites are based on user behavior. Data providers deliver both personal or non-personal information. There are two types of user data available in data stream:

  • Personally identifiable information (PII) – information that allows clearly or by combining with data identification methods identify a person. Examples of PII are: insurance ID, email address, phone number, IP address, geolocation, biometric data.[11]
  • Non-personally identifiable information (non-PII) is information that can't be used to identify a person or to track a location. A cookie or a device ID is an example of non-PII.

See Also

Streaming Algorithm


  1. ^ "Federal Standard 1037C data stream". Archived from the original on April 13, 2007. Retrieved April 4, 2007.
  2. ^ "Data Stream". Archived from the original on April 24, 2019. Retrieved April 24, 2019.
  3. ^ "Attribute". Archived from the original on April 24, 2019. Retrieved April 24, 2019.
  4. ^ "What constitutes data processing?". Archived from the original on April 24, 2019. Retrieved April 24, 2019.
  5. ^ "Non-Human Traffic [NHT]". June 7, 2017. Archived from the original on August 13, 2017. Retrieved April 24, 2019.
  6. ^ "BEHAVIORAL PROFILING AND PERSONALIZATION: CUSTOMER EXPERIENCE FIRST". Archived from the original on April 24, 2019. Retrieved April 24, 2019.
  7. ^ "What is Targeting – Meaning". Archived from the original on April 24, 2019. Retrieved April 24, 2019.
  8. ^ "What is Data Stream and how to use it". Archived from the original on April 24, 2019. Retrieved April 24, 2019.
  9. ^ "The 6 types of user segmentation and what they mean for your product". June 12, 2018.
  10. ^ "What is hybrid data management". January 2, 2018. Archived from the original on April 24, 2019. Retrieved April 24, 2019.
  11. ^ "What is personally identifiable information (PII)? How to protect it under GDPR". Archived from the original on April 24, 2019. Retrieved April 24, 2019.
This page was last edited on 30 May 2024, at 07:17
Basis of this page is in Wikipedia. Text is available under the CC BY-SA 3.0 Unported License. Non-text media are available under their specified licenses. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc. WIKI 2 is an independent company and has no affiliation with Wikimedia Foundation.