Matches in SemOpenAlex for { <https://semopenalex.org/work/W1923150> ?p ?o ?g. }
- W1923150 abstract "Data integration used to be offline, but real-time data integration has become more and more important. Research into stream databases can be naturally applied to near-realtime data integration. Several important problems in near-real-time data integration can be naturally expressed as joins. Many stream joins assume all join inputs to be streams. Recently, interest has been growing in joins with heterogeneous input, in particular joins between streams and disk-based input. MESHJOIN is a well known algorithm published in this area. The algorithm was designed particularly for application scenarios where memory resources are limited. However, the algorithm suffers from some limitations. Briefly, the memory distribution among the join components and the strategy used for accessing the disk-based data are suboptimal. This thesis provides an independent analysis of the MESHJOIN algorithm. The focus of analysis is on equijoins as one of the most important special cases of joins. It has been shown that if a realistic distribution is assumed on stream data, such as a Zipfian distribution, MESHJOIN performs suboptimally. A set of algorithms have been developed that address the problems in MESHJOIN and they perform better than MESHJOIN in defined settings. In the end, three robust algorithms have been developed for both sorted and unsorted disk-based data. For these algorithms cost models have been developed for tuning the algorithms and validation of our implementation. An experimental study has been carried out for comparing these algorithms empirically. For that purpose a synthetic workload generator has been designed and developed. With the synthetic datasets, measurements have been taken in experiments that validate the cost models of the algorithms. The implemented algorithms are made available publicly as open source for independent analysis. In the future this research can be extended in two directions. One is to improve the join operators further. The other is to apply the join operators in emerging application scenarios." @default.
- W1923150 created "2016-06-24" @default.
- W1923150 creator A5061284177 @default.
- W1923150 date "2012-01-01" @default.
- W1923150 modified "2023-09-27" @default.
- W1923150 title "Efficient joins to process stream data" @default.
- W1923150 cites W100509257 @default.
- W1923150 cites W111885355 @default.
- W1923150 cites W135511438 @default.
- W1923150 cites W1484813121 @default.
- W1923150 cites W1493149182 @default.
- W1923150 cites W1495986321 @default.
- W1923150 cites W1500547548 @default.
- W1923150 cites W1504439773 @default.
- W1923150 cites W150801922 @default.
- W1923150 cites W1515038756 @default.
- W1923150 cites W1520103883 @default.
- W1923150 cites W1551232187 @default.
- W1923150 cites W1560728783 @default.
- W1923150 cites W1572120187 @default.
- W1923150 cites W1648759563 @default.
- W1923150 cites W176053229 @default.
- W1923150 cites W1831808549 @default.
- W1923150 cites W1848127982 @default.
- W1923150 cites W1943411325 @default.
- W1923150 cites W1972833205 @default.
- W1923150 cites W1991962718 @default.
- W1923150 cites W1997802075 @default.
- W1923150 cites W2001474264 @default.
- W1923150 cites W2002834828 @default.
- W1923150 cites W2004943386 @default.
- W1923150 cites W2008492035 @default.
- W1923150 cites W2010801412 @default.
- W1923150 cites W2012802704 @default.
- W1923150 cites W2020147322 @default.
- W1923150 cites W2020601345 @default.
- W1923150 cites W2020754916 @default.
- W1923150 cites W2023027635 @default.
- W1923150 cites W2023997631 @default.
- W1923150 cites W2026302857 @default.
- W1923150 cites W2028859573 @default.
- W1923150 cites W2053287658 @default.
- W1923150 cites W2056259203 @default.
- W1923150 cites W205648239 @default.
- W1923150 cites W2056617836 @default.
- W1923150 cites W2057438377 @default.
- W1923150 cites W2060161902 @default.
- W1923150 cites W2063936751 @default.
- W1923150 cites W2064384103 @default.
- W1923150 cites W2080555007 @default.
- W1923150 cites W2082004608 @default.
- W1923150 cites W2085942561 @default.
- W1923150 cites W2094438648 @default.
- W1923150 cites W2096022210 @default.
- W1923150 cites W2099395665 @default.
- W1923150 cites W2103106131 @default.
- W1923150 cites W2105175890 @default.
- W1923150 cites W2105818147 @default.
- W1923150 cites W2106163100 @default.
- W1923150 cites W2109149785 @default.
- W1923150 cites W2110762996 @default.
- W1923150 cites W2112215401 @default.
- W1923150 cites W2113487460 @default.
- W1923150 cites W2119584748 @default.
- W1923150 cites W2120587290 @default.
- W1923150 cites W2120828587 @default.
- W1923150 cites W2122290246 @default.
- W1923150 cites W2122822096 @default.
- W1923150 cites W2124303732 @default.
- W1923150 cites W2126666814 @default.
- W1923150 cites W2128362607 @default.
- W1923150 cites W2131824593 @default.
- W1923150 cites W2132244350 @default.
- W1923150 cites W2132510297 @default.
- W1923150 cites W2132520482 @default.
- W1923150 cites W2132520571 @default.
- W1923150 cites W2132851185 @default.
- W1923150 cites W2146884585 @default.
- W1923150 cites W21469624 @default.
- W1923150 cites W2149576945 @default.
- W1923150 cites W2150606131 @default.
- W1923150 cites W2152198818 @default.
- W1923150 cites W2153485419 @default.
- W1923150 cites W2154616485 @default.
- W1923150 cites W2157469509 @default.
- W1923150 cites W2158238403 @default.
- W1923150 cites W2160034749 @default.
- W1923150 cites W2161578281 @default.
- W1923150 cites W2162228985 @default.
- W1923150 cites W2162654304 @default.
- W1923150 cites W2164149526 @default.
- W1923150 cites W2165231903 @default.
- W1923150 cites W2166604013 @default.
- W1923150 cites W2167276970 @default.
- W1923150 cites W2169486917 @default.
- W1923150 cites W2169593628 @default.
- W1923150 cites W2170691805 @default.
- W1923150 cites W2170699196 @default.
- W1923150 cites W2201552871 @default.
- W1923150 cites W2296677182 @default.