-
Notifications
You must be signed in to change notification settings - Fork 22
/
Copy pathAverage_Time_Of_Process_Per_Machine.sql
108 lines (95 loc) · 4.38 KB
/
Average_Time_Of_Process_Per_Machine.sql
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
-- Table: Activity
-- +----------------+---------+
-- | Column Name | Type |
-- +----------------+---------+
-- | machine_id | int |
-- | process_id | int |
-- | activity_type | enum |
-- | timestamp | float |
-- +----------------+---------+
-- The table shows the user activities for a factory website.
-- (machine_id, process_id, activity_type) is the primary key (combination of columns with unique values) of this table.
-- machine_id is the ID of a machine.
-- process_id is the ID of a process running on the machine with ID machine_id.
-- activity_type is an ENUM (category) of type ('start', 'end').
-- timestamp is a float representing the current time in seconds.
-- 'start' means the machine starts the process at the given timestamp and 'end' means the machine ends the process at the given timestamp.
-- The 'start' timestamp will always be before the 'end' timestamp for every (machine_id, process_id) pair.
-- There is a factory website that has several machines each running the same number of processes. Write a solution to find the average time each machine takes to complete a process.
-- The time to complete a process is the 'end' timestamp minus the 'start' timestamp. The average time is calculated by the total time to complete every process on the machine divided by the number of processes that were run.
-- The resulting table should have the machine_id along with the average time as processing_time, which should be rounded to 3 decimal places.
-- Return the result table in any order.
-- The result format is in the following example.
-- Example 1:
-- Input:
-- Activity table:
-- +------------+------------+---------------+-----------+
-- | machine_id | process_id | activity_type | timestamp |
-- +------------+------------+---------------+-----------+
-- | 0 | 0 | start | 0.712 |
-- | 0 | 0 | end | 1.520 |
-- | 0 | 1 | start | 3.140 |
-- | 0 | 1 | end | 4.120 |
-- | 1 | 0 | start | 0.550 |
-- | 1 | 0 | end | 1.550 |
-- | 1 | 1 | start | 0.430 |
-- | 1 | 1 | end | 1.420 |
-- | 2 | 0 | start | 4.100 |
-- | 2 | 0 | end | 4.512 |
-- | 2 | 1 | start | 2.500 |
-- | 2 | 1 | end | 5.000 |
-- +------------+------------+---------------+-----------+
-- Output:
-- +------------+-----------------+
-- | machine_id | processing_time |
-- +------------+-----------------+
-- | 0 | 0.894 |
-- | 1 | 0.995 |
-- | 2 | 1.456 |
-- +------------+-----------------+
-- Explanation:
-- There are 3 machines running 2 processes each.
-- Machine 0's average time is ((1.520 - 0.712) + (4.120 - 3.140)) / 2 = 0.894
-- Machine 1's average time is ((1.550 - 0.550) + (1.420 - 0.430)) / 2 = 0.995
-- Machine 2's average time is ((4.512 - 4.100) + (5.000 - 2.500)) / 2 = 1.456
-- Write your PostgreSQL query statement below
-- Solution
select machine_id,
ROUND(CAST("processing_time" AS NUMERIC), 3) AS processing_time
from
(
select X.machine_id machine_id,
X.total_completion_time/Y.num_processes as processing_time
from
(
select transformed.machine_id as machine_id,
sum(transformed.process_completion_time) as total_completion_time
from
(
select A.machine_id,
A.process_id,
A.activity_type A_atype,
A.timestamp A_tstamp,
B.activity_type B_atype,
B.timestamp B_tstamp,
B.timestamp - A.timestamp as process_completion_time
from Activity A
join Activity B
on (A.machine_id = B.machine_id
and A.process_id = B.process_id)
where (A.timestamp < B.timestamp
and A.activity_type = 'start'
and B.activity_type = 'end')
) transformed
group by transformed.machine_id
)
X
join
(
select machine_id,
count(process_id) as num_processes
from (select distinct machine_id, process_id from Activity)
group by machine_id
)
Y on X.machine_id = Y.machine_id
)