You run the Information Technology (IT) department of a major firm that has thousands of employees. You want to collect real-time application monitoring data that includes application process logs from the computers of each of these employees.
The analysis of this data can be useful in many ways. Information about application usage helps you to manage IT resources optimally, reduce spending, and increase employee productivity by resolving issues. The application monitoring data is published by HTTP clients. You want to collect this data from the HTTP clients in real time and write it to HDFS for further processing.
You perform the following tasks:
In the Administrator tool, create the Edge Data Streaming Service.
Create a data flow with an HTTP source service and an HDFS target service.
Add an Insert String transformation that appends the IP address of the machine.
Deploy the data flow. The HTTP source service receives data from an HTTP client that sends data through HTTP POST requests and sends the data through a data connection. The HDFS target service receives this data through the data connection.