Guaranteeing SLAs for Jobs in Microservice Execution Frameworks EuroSys ’19, March 25–28, 2019, Dresden, Germany
References
[1]
Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis,
Jerey Dean, Matthieu Devin, Sanjay Ghemawat, Georey Irving,
Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry
Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan,
Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016.
TensorFlow: A System for Large-Scale Machine Learning. In 12th
USENIX Symposium on Operating Systems Design and Implementation
(OSDI 16).
[2]
Paula Aguilera, Katherine Morrow, and Nam Sung Kim. 2014. Fair
share: Allocation of GPU resources for both performance and fairness.
In Proceedings of the IEEE 32nd International Conference on Computer
Design (ICCD 14).
[3]
Amazon. 2019. What is AWS Lambda? hps://docs.aws.amazon.com/
lambda/latest/dg/welcome.html. (2019).
[4]
Amazon. 2019. What is AWS Step Functions? hp://docs.aws.amazon.
com/step-functions/latest/dg/welcome.html. (2019).
[5]
Microsoft Azure. 2019. Azure Functions Serverless Architecture. hps:
//azure.microso.com/en-us/services/functions/. (2019).
[6]
Luiz André Barroso, Jimmy Clidaras, and Urs Hölzle. 2013. The data-
center as a computer: An introduction to the design of warehouse-scale
machines. Synthesis lectures on computer architecture 8, 3 (2013), 1–154.
[7]
Quan Chen, Hailong Yang, Minyi Guo, Ram Srivatsa Kannan, Jason
Mars, and Lingjia Tang. 2017. Prophet: Precise QoS Prediction on
Non-Preemptive Accelerators to Improve Utilization in Warehouse-
Scale Computers. In Proceedings of the Twenty-Second International
Conference on Architectural Support for Programming Languages and
Operating Systems (ASPLOS 17).
[8]
Quan Chen, Hailong Yang, Jason Mars, and Lingjia Tang. 2016. Bay-
max: QoS Awareness and Increased Utilization for Non-Preemptive
Accelerators in Warehouse Scale Computers. In Proceedings of the
Twenty-First International Conference on Architectural Support for Pro-
gramming Languages and Operating Systems (ASPLOS 16).
[9]
Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray
Kavukcuoglu, and Pavel P. Kuksa. 2011. Natural Language Processing
(almost) from Scratch. CoRR abs/1103.0398 (2011). hp://arxiv.org/
abs/1103.0398
[10]
Christina Delimitrou and Christos Kozyrakis. 2013. Paragon: QoS-
aware Scheduling for Heterogeneous Datacenters. In Proceedings of
the Eighteenth International Conference on Architectural Support for
Programming Languages and Operating Systems (ASPLOS 13).
[11]
Tarek Elgamal, Atul Sandur, Klara Nahrstedt, and Gul Agha. 2018.
Costless: Optimizing Cost of Serverless Computing through Function
Fusion and Placement. CoRR abs/1811.09721 (2018). arXiv:1811.09721
hp://arxiv.org/abs/1811.09721
[12]
Sameh Elnikety, Erich Nahum, John Tracey, and Willy Zwaenepoel.
2004. A Method for Transparent Admission Control and Request Sched-
uling in e-Commerce Web Sites. In Proceedings of the 13th International
Conference on World Wide Web (WWW 04).
[13]
A. Gheith, R. Rajamony, P. Bohrer, K. Agarwal, M. Kistler, B. L. White
Eagle, C. A. Hambridge, J. B. Carter, and T. Kaplinger. 2016. IBM
Bluemix Mobile Cloud Services. IBM Journal of Research and Develop-
ment 60, 2-3 (March 2016), 7:1–7:12.
[14]
Google. 2019. Serverless Environment to Build and Connect Cloud
Services. hps://cloud.google.com/functions/. (2019).
[15]
Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis, Lukasz
Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming
He. 2017. Accurate, large minibatch SGD: training imagenet in 1 hour.
arXiv preprint arXiv:1706.02677 (2017).
[16]
Johann Hauswald, Yiping Kang, Michael A. Laurenzano, Quan Chen,
Cheng Li, Ronald Dreslinski, Trevor Mudge, Jason Mars, and Lingjia
Tang. 2015. Djinn and Tonic: DNN as a Service and Its Implications for
Future Warehouse Scale Computers. In Proceedings of the 42nd Annual
International Symposium on Computer Architecture (ISCA 15).
[17]
Johann Hauswald, Michael A. Laurenzano, Yunqi Zhang, Cheng Li,
Austin Rovinski, Arjun Khurana, Ron Dreslinski, Trevor Mudge, Vini-
cius Petrucci, Lingjia Tang, and Jason Mars. 2015. Sirius: An Open
End-to-End Voice and Vision Personal Assistant and Its Implications
for Future Warehouse Scale Computers. In Proceedings of the Twenti-
eth International Conference on Architectural Support for Programming
Languages and Operating Systems (ASPLOS 15).
[18]
Yuxiong He, Sameh Elnikety, James Larus, and Chenyu Yan. 2012. Zeta:
Scheduling Interactive Services with Partial Execution. In Proceedings
of the Third ACM Symposium on Cloud Computing (SoCC 12).
[19]
Scott Hendrickson, Stephen Sturdevant, Tyler Harter, Venkateshwaran
Venkataramani, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-
Dusseau. 2016. Serverless Computation with OpenLambda. In 8th
USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 16).
[20]
IBM. 2019. IBM Cloud Functions. hps://www.ibm.com/cloud/
functions. (2019).
[21]
Muhammad Hussain Iqbal and Tariq Rahim Soomro. 2015. Big data
analysis: Apache storm perspective. International journal of computer
trends and technology 19, 1 (2015), 9–14.
[22]
Virajith Jalaparti, Peter Bodik, Srikanth Kandula, Ishai Menache,
Mikhail Rybalkin, and Chenyu Yan. 2013. Speeding Up Distributed
Request-response Workows. In Proceedings of the ACM SIGCOMM
2013 Conference on SIGCOMM.
[23]
Adwait Jog, Evgeny Bolotin, Zvika Guz, Mike Parker, Stephen W.
Keckler, Mahmut T. Kandemir, and Chita R. Das. 2014. Application-
aware Memory System for Fair and Ecient Execution of Concurrent
GPGPU Applications. In Proceedings of Workshop on General Purpose
Processing Using GPUs (GPGPU 14).
[24]
Evangelia Kalyvianaki, Marco Fiscato, Theodoros Salonidis, and Peter
Pietzuch. 2016. THEMIS: Fairness in Federated Stream Processing
Under Overload. In Proceedings of the 2016 International Conference on
Management of Data (SIGMOD 16).
[25]
Svilen Kanev, Juan Pablo Darago, Kim Hazelwood, Parthasarathy Ran-
ganathan, Tipp Moseley, Gu-Yeon Wei, and David Brooks. 2015. Pro-
ling a Warehouse-scale Computer. In Proceedings of the 42nd Annual
International Symposium on Computer Architecture (ISCA 15).
[26]
S. Kanev, K. Hazelwood, G. Y. Wei, and D. Brooks. 2014. Tradeos
between power management and tail latency in warehouse-scale appli-
cations. In IEEE International Symposium on Workload Characterization
(IISWC 14).
[27]
R. S. Kannan, A. Jain, M. A. Laurenzano, L. Tang, and J. Mars. 2018.
Proctor: Detecting and Investigating Interference in Shared Datacen-
ters. In 2018 IEEE International Symposium on Performance Analysis of
Systems and Software (ISPASS 18).
[28]
O. Kayiran, N. C. Nachiappan, A. Jog, R. Ausavarungnirun, M. T. Kan-
demir, G. H. Loh, O. Mutlu, and C. R. Das. 2014. Managing GPU
Concurrency in Heterogeneous Architectures. In 2014 47th Annual
IEEE/ACM International Symposium on Microarchitecture (MICRO 14).
[29]
Kris Kobylinski. 2015. Agile Software Development for Bluemix with
IBM DevOps Services. In Procee dings of the 25th Annual International
Conference on Computer Science and Software Engineering (CASCON
15).
[30]
Jason Mars and Lingjia Tang. 2013. Whare-map: Heterogeneity in
"Homogeneous" Warehouse-scale Computers. In Proceedings of the
40th Annual International Symposium on Computer Architecture (ISCA
13).
[31]
Jason Mars, Lingjia Tang, Robert Hundt, Kevin Skadron, and Mary Lou
Soa. 2011. Bubble-Up: Increasing Utilization in Modern Warehouse
Scale Computers via Sensible Co-locations. In Proceedings of the 44th
Annual IEEE/ACM International Symposium on Microarchitecture (MI-
CRO 11).
[32]
Sean Marston, Zhi Li, Subhajyoti Bandyopadhyay, Juheng Zhang, and
Anand Ghalsasi. 2011. Cloud Computing - The Business Perspective.
Decis. Support Syst. 51, 1 (April 2011), 14.