Search code examples
javaxmlscalajavax

how to create a set by parsing xml in which every children has his parent


I am using javax xml for parsing XML, what I want is to include parent in all children.

<a>
   <b value1 = "b1", value2 = "b2">
       <c value1 = "c1", value2 = "c2" />
       <c value1 = "c3", value2 = "c4" />
   </b>
   <b value1 = "b3">
       <c value1="c5" />
       <c value1 ="c6" />
   </b>
</a>

I want result to be in form like:

Set((b1,c1,c2), (b1,c3,c4), (b3, c5, ""), (b3, c6, "")) 

Solution

  • Here is how I was able to achieve this using XML literals in Scala. It is not the most efficient approach, but it accomplishes the goal you specified:

    scala> val xml: scala.xml.Elem = <a>
      <b value1="b1" value2="b2">
        <c value1="c1" value2="c2"/>
        <c value1="c3" value2="c4"/>
      </b>
      <b value1="b3">
        <c value1="c5"/>
        <c value1="c6"/>
      </b>
    </a>
    
    scala> val bs = xml\ "b"
    bs: scala.xml.NodeSeq = <b value1="b1" value2="b2">
        <c value1="c1" value2="c2"/>
        <c value1="c3" value2="c4"/>
      </b><b value1="b3">
        <c value1="c5"/>
        <c value1="c6"/>
      </b>
    
    scala> val gatheringNodes = bs.map { b =>
      val cs = b\"c"
      b\"@value1" -> cs.map(c => (c\"@value1", c\"@value2"))
    }
    gatheringNodes: scala.collection.immutable.Seq[(scala.xml.NodeSeq, scala.collection.immutable.Seq[(scala.xml.NodeSeq, scala.xml.NodeSeq)])] = List((b1,List((c1,c2), (c3,c4))), (b3,List((c5,), (c6,))))
    
    scala> val finalOutput = gatheringNodes.flatMap { case(b, cs) =>
        cs.map { case(c1, c2) => (b, c1, c2)}
    }.toSet
    finalOutput: scala.collection.immutable.Set[(scala.xml.NodeSeq, scala.xml.NodeSeq, scala.xml.NodeSeq)] = Set((b1,c1,c2), (b1,c3,c4), (b3,c5,), (b3,c6,))