[solution] GP2. Optimal Graph Partition for Fast Routing (GP2)

 [solution] GP2. Optimal Graph Partition for Fast Routing (GP2)

time limit per test
5 seconds
memory limit per test
512 megabytes
input
standard input
output
standard output
 [solution] GP2. Optimal Graph Partition for Fast Routing (GP2)

In computer networks, the Internet Protocol (IP) uses path routing. With path routing, the packet contains only the destination address; routers decide which "next hop" to forward each packet on to get it closer to its destination. Each router makes this decision locally, based only on the destination address and its local configuration. One way to store this configuration is to use routing tables, which means for each router, we store the "next hop" for every possible destination. However, as the scale of the network grows larger, the size of the router table grows as well. It is a big issue since routers only have limited computing resources. One way to address this issue is to divide the graph into several partitions.

For example, in the above figure, there are 12 routers in the network. Each edge has a certain weight. Each router has to store 12 table entries for all possible destinations in the network (including itself). We can merge four nodes in the graph into one large abstract node R2. When routers are merged, all existing edges connected to the elements of a large node are preserved. Then all routers outside R2 only need to store 9 table entries. Therefore, in this case, we can assign the same "next hop" for all destination routers in R2. The intuition behind this is that router R1 does not need to care how to get R24 from R21 if R1 wants to send a packet to R24R1 just needs to find a path to get its packet to an arbitrary router inside R2 and then let the routers inside R2 decide how to get R24. For routers inside R2, however, there are still 12 table entries, because they need to know the routes to each other.

We can further merge R31,R32,R33,R34 into a single node R3. Then for R1, we need to store only 6 entries in its routing table. For routers in R3 or R2, it only needs to store 9 entries. Your task is to find an optimal partition of the initial graph to minimize the maximum size of the routing table for all routers in the network.

A computer network under consideration is an undirected connected graph G=(V,E), where |V|=N (2N105)|E|=M (N1M2×105)V={1,2,,N}E={e=(u,v,w)There is an edge between nodes u and v with weight w}.

A partition P of the network G is a set of non-empty subsets of V such that every node vV belongs to exactly one of these subsets, and the induced sub-graph for each subset in P is connected. Here, one subset corresponds to one abstract node mentioned above.

 [solution] GP2. Optimal Graph Partition for Fast Routing (GP2)

Formally, P is a valid partition if and only if P is a family of sets that satisfies:

  • P
  • APA=V
  • A,BP:ABAB=
  • AP, the induced subgraph G[A] is connected

The routing table size of any node v for the corresponding partition P is defined by the following equation:

RTsize(v,P)=|P|+|Q|1,
where |P| is the number of subsets in PQ is the subset in P such that vQ|Q| is the size of Q.

The "next hop" is defined by the shortest path tree. For each node pair (u,v),uV,vV, the "next hop" is a node zV, such that there is an edge eE connected from u to z directly, and dist(u,v)=w(e)+dist(z,v), here dist(u,v) denotes the length of the shortest path from u to v, and w(e) denotes the weight of the edge connecting u and z. The valid "next hop" may not be unique. After the graph partitioning, the "next hop" for nodes in the same subset AP is calculated via the shortest path in the induced sub-graph G[A] using the same method above. We compute the "next hop" for two nodes from different subsets of P by calculating the shortest path in the "compressed" graph (we compress all nodes in the same subset into a single node). The new graph only contains edges between nodes from different subsets. And then find the shortest path from the subset containing u to the subset containing v. Suppose the first edge in this path is between (x,y). From the definition, xy are in different subsets of P, and u is in the same subset of x. If x=u, then the "next hop" of (u,v) is y, otherwise the "next hop" of (u,v) is the "next hop" of (u,x).

The path from u to v may differ from the path in the graph before partition.

For example:

Originally, the packet transmission path from 1 to 2 is the shortest in the whole graph, so it is 132, and the total length is 3.

If our partition P={{1,2},{3},{4}}, then the packet transmission path from 1 to 2 is the shortest path in the subgraph, so it is 12. The total length is 10, which gets much greater after partition.

Define dist(u,v) as the length of the shortest path from u to v in graph G.

Let cost(u,v,P) be the length of the optimal transmission path after dividing graph G by partition P. The way to determine cost(u,v,P) is specified later.

For node u, we define B(u,P)=Q, where Q is the unique subset in P such that uQ.

 [solution] GP2. Optimal Graph Partition for Fast Routing (GP2)

Define two new edge sets:

EdgeZero(P)={(u,v,w)(u,v,w)EB(u,P)B(v,P)}{(u,v,0)(u,v,w)EB(u,P)=B(v,P)}

EdgeInf(P)={(u,v,)(u,v,w)EB(u,P)B(v,P)}{(u,v,w)(u,v,w)EB(u,P)=B(v,P)}

Define two undirected weighted graphs: GraphZero(P)=(V,EdgeZero(P))GraphInf(P)=(V,EdgeInf(P)).

 [solution] GP2. Optimal Graph Partition for Fast Routing (GP2)

In other words, GraphZero(P)=(V,EdgeZero(P)) is almost a copy of the original graph, except for all edges that have both endpoints in the same subset of P (their weights are set to 0). GraphInf(P)=(V,EdgeInf(P)) is also almost a copy of the original graph, except for all edges that connect different subsets of P (their weights are set to infinity).

Let disZero(u,v,P) be the length of the shortest path from u to v in GraphZero(P).

Let disInf(u,v,P) be the length of the shortest path from u to v in GraphInf(P).

For nodes u,v: (B(u,P)B(v,P)), we define that middle(u,v,P) be the edge e=(x,y,w)EdgeZero(P) with the smallest edge index which makes the following equation holds:

disZero(u,v,P)=disZero(u,x,P)+w+disZero(y,v,P), w>0

We can prove such an edge always exists.

Then cost(u,v,P) can be calculated recursively:

  • For nodes u,v: (B(u,P)=B(v,P))cost(u,v,P)=disInf(u,v,P).
  • For nodes u,v: (B(u,P)B(v,P)), let e=(x,y,w)=middle(u,v,P), then cost(u,v,P)=cost(u,x,P)+w+cost(y,v,P).

Given a real number k0, and a vertex set SV, 1|S|min(50,N), find an optimal partition P, which maximizes Score2(P).

The score for SubTask2 for the ith graph is calculated by the formula:

Score2i=Score2(P=P(Gi))=max{0,N[maxvVRTsize(v,P)]kmaxuS,vV,uvcost(u,v,P)dist(u,v)dist(u,v)}.

The final score for SubTask2 is calculated according to the following formula:

Score2=0.7×Score2i

 [solution] GP2. Optimal Graph Partition for Fast Routing (GP2)
Input

The first line contains two integers N and M (2N105;N1M2×105) — the number of nodes and edges, respectively.

Each of the next M lines contains three integers ui,vi and wi (0ui, viN1, uivi, 1wi105), denoting an edge between the node ui and vi with weight wiei=(ui,vi,wi) is the edge with index i in the edge set E. It is guaranteed that the input graph is connected and there are no multiple edges. That is, any pair (in any order) of nodes appear in this list no more than once.

The next line contains two numbers: one integer SetSize (1SetSizemin(50,N)), the size of node set S and one real number k, given with at most six digits after the decimal point (0k106).

Each of the next SetSize lines contains one integer NodeID (0NodeIDN1), the node in S.



 [solution] GP2. Optimal Graph Partition for Fast Routing (GP2)
Output

In the first line, output the number of subsets in your partition P.

The (i+1)-th line should begin with a single number cnti — the size of the i-th subset, followed by cnti numbers, denoting the node IDs in the i-th subset.



 [solution] GP2. Optimal Graph Partition for Fast Routing (GP2)
Scoring

During the coding phase, your solutions are tested against pre-tests only. The first 6 of them are open and available for download at the link problem-gp2-open-testset.zip in the "contest materials" section in a sidebar.

After the end of the coding phase, the last submission that scored a positive number of points on the pre-tests will be selected. It will be retested in the final tests. The rest of your submissions will be ignored. The final tests are secret and not available to the participants, they are mostly realistic. The result obtained on the final tests is your final official result for this problem.

  1. According to the formula, the solution with a larger score wins.
  2. If two solutions have the same score, the solution submitted first wins.
  3. For multiple test cases, the ranking is performed based on the following formula:
    Score2=0.7×Score2i.
  4. Final formula for the "ICPC 2022 Online Challenge powered by HUAWEI - Problem 1":
    FinalScore=Score1+Score2.

 [solution] GP2. Optimal Graph Partition for Fast Routing (GP2)
Examples
input
Copy
4 4
0 1 10
2 3 12
0 2 2
2 1 1
4 0
0
1
2
3
output
Copy
2
2 0 1
2 2 3
input
Copy
4 4
0 1 10
2 3 12
0 2 2
2 1 1
4 2.3
0
1
2
3
output
Copy
2
2 0 1
2 2 3

Comments

Popular posts from this blog

Whiteboarding Interviews

Version Control with Git: A Beginner’s Guide

Callback function in JavaScript