PDPS-Bench TPCTC2024@VLDB
Published:
โ ๏ธ ๐ข๐ป๐ฒ ๐ผ๐ณ ๐๐ต๐ฒ ๐ธ๐ฒ๐ ๐ฐ๐ต๐ฎ๐น๐น๐ฒ๐ป๐ด๐ฒ๐ ๐ถ๐ป ๐บ๐ฎ๐ฐ๐ต๐ถ๐ป๐ฒ ๐น๐ฒ๐ฎ๐ฟ๐ป๐ถ๐ป๐ด ๐ฒ๐๐ฝ๐ฒ๐ฐ๐ถ๐ฎ๐น๐น๐ ๐ณ๐ผ๐ฟ ๐ผ๐ฝ๐๐ถ๐บ๐ถ๐๐ฎ๐๐ถ๐ผ๐ป ๐ฝ๐ฟ๐ผ๐ฏ๐น๐ฒ๐บ๐, ๐ถ๐ ๐ด๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐ป๐ด ๐ต๐ถ๐ด๐ต-๐พ๐๐ฎ๐น๐ถ๐๐ ๐ฑ๐ฎ๐๐ฎ. This data directly impacts the accuracy of models. In scenarios like parallel and distributed stream processing, the need for high-quality training and evaluation data becomes even more crucial.
๐ก To tackle this, we introduced ๐ฃ๐๐ฆ๐ฃ-๐๐ฒ๐ป๐ฐ๐ต, a performance benchmarking system designed to evaluate parallel and distributed stream processing (DSP) in heterogeneous environments. ๐ฃ๐๐ฆ๐ฃ-๐๐ฒ๐ป๐ฐ๐ต ๐ฎ๐ถ๐บ๐ ๐๐ผ ๐ฝ๐ฟ๐ผ๐๐ถ๐ฑ๐ฒ ๐ต๐ถ๐ด๐ต-๐พ๐๐ฎ๐น๐ถ๐๐ ๐ฝ๐ฒ๐ฟ๐ณ๐ผ๐ฟ๐บ๐ฎ๐ป๐ฐ๐ฒ ๐ฏ๐ฒ๐ป๐ฐ๐ต๐บ๐ฎ๐ฟ๐ธ ๐ฑ๐ฎ๐๐ฎ๐๐ฒ๐๐ to ensure that DSP optimization mechanisms using machine learning can be effectively trained and fine-tuned in diverse and dynamic environments.
โญ I had the privilege of presenting our paper ๐ฃ๐๐ฆ๐ฃ-๐๐ฒ๐ป๐ฐ๐ต at ๐ง๐ฃ๐๐ง๐ ๐ฎ๐ฌ๐ฎ๐ฐ, held as part of the ๐ฉ๐๐๐ ๐๐ผ๐ป๐ณ๐ฒ๐ฟ๐ฒ๐ป๐ฐ๐ฒ in China and sharing our work with the global data management community.
๐ See the full update on LinkedIn
๐ Our paper: