Publications

2025

  1. No Two Snowflakes Are Alike: Studying eBPF Libraries’ Performance, Fidelity and Resource UsageMachado, C., Gião, B., Amaro, S., Matos, M., Paulo, J., & Esteves, T.In Workshop on eBPF and Kernel Extensions (eBPF’25), colocated with SIGCOMM. ACM, 2025. (To appear)

  2. Keigo: Co-designing Log-Structured Merge Key-Value Stores with a Non-Volatile, Concurrency-aware Storage HierarchyAdão, R., Wu, Z., Zhou, C., Balmau, O., Paulo, J., & Macedo, R.Proceedings of the VLDB Endowment. 2025. (Core Rank A* and Q1 on Scimago)

  3. Keigo: Co-designing Log-Structured Merge Key-Value Stores with a Non-Volatile, Concurrency-aware Storage Hierarchy (Extended Version)Adão, R., Wu, Z., Zhou, C., Balmau, O., Paulo, J., & Macedo, R. ArXiv - Computing Research Repository (CoRR), 2506.14630, 2025.

  4. Exploiting Trusted Execution Environments And Distributed Computation for Genomic Association TestsBrito, C., Ferreira, P., & Paulo, J.IEEE Journal of Biomedical and Health Informatics. 2025. (Q1 on Scimago)

  5. Promoting Sustainable And Personalized Travel Behaviors While Preserving Data PrivacyBrito, C., Pina, N., Esteves, T., Vitorino, R., Cunha, I., & Paulo, J.Transportation Engineering. 2025. (Q1 on Scimago)

2024

  1. When Amnesia Strikes: Understanding And Reproducing Data Loss Bugs With Fault InjectionRamos, M., Azevedo, J., Kingsbury, K., Pereira, J., Esteves, T., Macedo, R., & Paulo, J.Proceedings of the VLDB Endowment. 2024. (Core Rank A* and Q1 on Scimago)

  2. Can Current SDS Controllers Scale To Modern HPC Infrastructures?Miranda, M., Tanimura, Y., Haga, J., Ruhela, A., Harrell, S., Cazes, J., Macedo, R., Pereira, J., & Paulo, J.In Workshop of Intel Extreme Performance Users Group (IXPUG), colocated with SC. IEEE, 2024.

2023

  1. Privacy-Preserving Machine Learning on Apache SparkBrito, C., Ferreira, P., Portela, B., Oliveira, R. C., & Paulo, J.IEEE Access. 2023. (Q1 on Scimago)

  2. Toward A Practical And Timely Diagnosis of Application’s I/O BehaviorEsteves, T., Macedo, R., Oliveira, R. C., & Paulo, J.IEEE Access. 2023. (Q1 on Scimago)

  3. CRIBA: A Tool For Comprehensive Analysis Of Cryptographic Ransomware’s I/O BehaviorEsteves, T., Pereira, B., Oliveira, R. P., Marco, J., & Paulo, J.In 42nd International Symposium on Reliable Distributed Systems (SRDS). IEEE, 2023.

  4. Diagnosing Applications’ I/O Behavior Through System Call ObservabilityEsteves, T., Macedo, R., Oliveira, R. C., & Paulo, J.In Workshop on Data-Centric Dependability and Security (DCDS), colocated with DSN. IEEE, 2023.

  5. Diagnosing applications’ I/O behavior through system call observabilityEsteves, T., Macedo, R., Oliveira, R., & Paulo, J. ArXiv - Computing Research Repository (CoRR), 2304.08569, 2023.

  6. Taming Metadata-Intensive HPC Jobs Through Dynamic, Application-Agnostic QoS ControlMacedo, R., Miranda, M., Tanimura, Y., Haga, J., Ruhela, A., Harrell, S., Evans, R., Pereira, J., & Paulo, J.In IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid). IEEE, 2023. (Core B)

  7. PADLL: Taming Metadata-intensive HPC Jobs Through Dynamic, Application-agnostic QoS ControlMacedo, R., Miranda, M., Tanimura, Y., Haga, J., Ruhela, A., Harrell, S., Evans, R., Pereira, J., & Paulo, J. ArXiv - Computing Research Repository (CoRR), 2302.06418, 2023.

  8. SOTERIA: Preserving Privacy in Distributed Machine LearningBrito, C., Ferreira, P., Portela, B., Oliveira, R. C., & Paulo, J.In Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing (SAC). ACM, 2023.

  9. Understanding storage I/O patterns through system call observabilityEsteves, T., Macedo, R., Oliveira, R., & Paulo, J.European Conference on Computer Systems (EuroSys). 2023. (Poster)

  10. Emission-Aware Federated Learning: A Case Study on Transportation and Carbon FootprintBrito, C., Pina, N., Vitorino, R., Cunha, I., & Paulo, J.European Conference on Computer Systems (EuroSys). 2023. (Poster)

  11. PADLL: Taming Metadata Burstiness of HPC Jobs Through Application-level QoS ControlMacedo, R., Miranda, M., Tanimura, Y., Haga, J., Ruhela, A., Harrell, S., Evans, R., Pereira, J., & Paulo, J.European Conference on Computer Systems (EuroSys). 2023. (Poster)

2022

  1. Protecting Metadata Servers From Harm Through Application-level I/O ControlMacedo, R., Miranda, M., Tanimura, Y., Haga, J., Ruhela, A., Harrell, S., Evans, R., & Paulo, J.In Workshop on Re-envisioning Extreme-Scale I/O for Emerging Hybrid HPC Workloads (REX-IO), colocated with CLUSTER. IEEE, 2022.

  2. Accelerating Deep Learning Training Through Transparent Storage TieringDantas, M., Leitão, D., Cui, P., Macedo, R., Liu, X., Xu, W., & Paulo, J.In 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid). IEEE, 2022. (Core A)

  3. PAIO: General, Portable I/O Optimizations With Minor Application ModificationsMacedo, R., Tanimura, Y., Haga, J., Chidambaram, V., Pereira, J., & Paulo, J.In 20th USENIX Conference on File and Storage Technologies (FAST). USENIX, 2022. (Core A)

2021

  1. CaT: Content-Aware Tracing and Analysis for Distributed SystemsEsteves, T., Neves, F., Oliveira, R. C., & Paulo, J.In 22nd International Middleware Conference. ACM, 2021. (Core A)

  2. Pods-as-Volumes: Effortlessly Integrating Storage Systems and Middleware into KubernetesFaria, A., Macedo, R., & Paulo, J.In Workshop on Container Technologies and Container Clouds (WoC’21). 2021.

  3. Monarch: Hierarchical Storage Management for Deep Learning FrameworksDantas, M., Leitão, D., Correia, C., Macedo, R., Xu, W., & Paulo, J.In Workshop on Re-envisioning Extreme-Scale I/O for Emerging Hybrid HPC Workloads (REX-IO). 2021.

  4. The Case for Storage Optimization Decoupling in Deep Learning FrameworksMacedo, R., Correia, C., Dantas, M., Brito, C., Xu, W., Tanimura, Y., Haga, J., & Paulo, J.In Workshop on Re-envisioning Extreme-Scale I/O for Emerging Hybrid HPC Workloads (REX-IO). 2021.

  5. ATOCS - Automatic Configuration of Secure DatabasesFerreira, D., Paulo, J., & Matos, M.In European Dependable Computing Conference (EDCC). 2021.

  6. BDUS: Implementing Block Devices in User SpaceFaria, A., Macedo, R., Pereira, J., & Paulo, J.In International Systems and Storage Conference (SYSTOR). 2021. (Best paper runner up)

  7. S2Dedup: SGX-enabled Secure DeduplicationMiranda, M., Esteves, T., Portela, B., & Paulo, J.In International Systems and Storage Conference (SYSTOR). 2021. (Best paper runner up)

  8. Realistic Assessment of Faults in Storage SystemsBorges, C., & Paulo, J.In Workshop on High-Performance and Reliable Big Data (HPBD). 2021.

  9. Geolocate: A geolocation-aware scheduling system for Edge ComputingVilaça, J., Paulo, J., & Vilaça, R.In Workshop on High-Performance and Reliable Big Data (HPBD). 2021.

  10. Soteria: Privacy-Preserving Machine Learning for Apache SparkBrito, C., Ferreira, P., Portela, B., Oliveira, R., & Paulo, J. Cryptology ePrint Archive, Report 2021/966, 2021.

  11. PAIO: A Software-Defined Storage Data Plane FrameworkMacedo, R., Tanimura, Y., Haga, J., Chidambaram, V., Pereira, J., & Paulo, J. ArXiv - Computing Research Repository (CoRR), 2106.03617, 2021.

2020

  1. On the Trade-Offs of Combining Multiple Secure Processing Primitives for Data AnalyticsCarvalho, H., Cruz, D., Pontes, R., Paulo, J., & Oliveira, R.In Distributed Applications and Interoperable Systems (DAIS). 2020. (Core B)

  2. GenoDedup: Similarity-Based Deduplication and Delta-Encoding for Genome Sequencing DataCogo, V., Paulo, J., & Bessani, A.IEEE Transactions on Computers. 2020. (Q1 in Scimago)

  3. A Survey and Classification of Software-Defined Storage SystemsMacedo, R., Paulo, J., Pereira, J., & Bessani, A.ACM Computing Surveys. 2020. (Q1 in Scimago)

2019

  1. A Case for Dynamically Programmable Storage Background TasksMacedo, R., Faria, A., Paulo, J., & Pereira, J.In Symposium on Reliable Distributed Systems Workshops (SRDSW). 2019.

  2. TrustFS: An SGX-Enabled Stackable File System FrameworkEsteves, T., Macedo, R., Faria, A., Portela, B., Paulo, J., Pereira, J., & Harnik, D.In Symposium on Reliable Distributed Systems Workshops (SRDSW). 2019.

  3. SafeSpark: Plataforma de Processamento Analítico SeguroCarvalho, H., Cruz, D., Pontes, R., Paulo, J., & Oliveira, R.In INFOrum - Simpósio de Informática. 2019.

2017

  1. SafeFS: A Modular Architecture for Secure User-Space File Systems (One FUSE to rule them all)Pontes, R., Burihabwa, D., Maia, F., Paulo, J., Schiavoni, V., Felber, P., Mercier, H., & Oliveira, R.In International Systems and Storage Conference (SYSTOR). 2017. (Best Student Paper Award)

  2. A Practical Framework for Privacy-Preserving NoSQL DatabasesMacedo, R., Paulo, J., R., P., Portela, B., Oliveira, T., Matos, M., & Oliveira, R.In Symposium on Reliable Distributed Systems (SRDS). 2017. (Core A)

  3. DDFlasks: Deduplicated Very Large Scale Data StoreMaia, F., Paulo, J., Coelho, F., Neves, F., Pereira, J., & Oliveira, R.In Distributed Applications and Interoperable Systems (DAIS). 2017. (Core B)

  4. HTAPBench: Hybrid Transactional and Analytical Processing BenchmarkCoelho, F., Paulo, J., Vilaça, R., Pereira, J., & Oliveira, R.In International Conference on Performance Engineering (ICPE). 2017. (Core B)

2016

  1. Efficient Deduplication in a Distributed Primary Storage InfrastructurePaulo, J., & Pereira, J.ACM Transactions on Storage. 2016. (Q1 in Scimago)

  2. On the Cost of Safe Storage for Public Clouds: an Experimental EvaluationBurihabwa, D., Pontes, R., Felber, P., Maia, F., Mercier, H., Oliveira, R., Paulo, J., & Schiavoni, V.In Symposium on Reliable Distributed Systems (SRDS). 2016. (Core A)

  3. SafeRegions: Performance Evaluation of Multi-party Protocols on HBasePontes, R., Maia, F., Paulo, J., & Vilaça, R.In Symposium on Reliable Distributed Systems Workshops (SRDSW). 2016.

  4. Resource Usage Prediction in Distributed Key-Value DatastoresCruz, F., Maia, F., Matos, M., Oliveira, R., Paulo, J., Pereira, J., & Vilaça, R.In Distributed Applications and Interoperable Systems (DAIS). 2016. (Core B)

2015

  1. Dependable Decentralized Storage Management for Cloud Computing (Ph.D. thesis)Paulo, J. University of Minho, Braga, Portugal, 2015. (Supervision: José Orlando Pereira)

2014

  1. A Survey and Classification of Storage Deduplication SystemsPaulo, J., & Pereira, J.ACM Computing Surveys. 2014. (Q1 in Scimago)

  2. Distributed Exact Deduplication for Primary Storage InfrastructuresPaulo, J., & Pereira, J.In Distributed Applications and Interoperable Systems (DAIS). 2014. (Core B)

2013

  1. Towards an accurate evaluation of deduplicated storage systemsPaulo, J., Reis, P., Pereira, J., & Sousa, A.International Journal of Computer Systems Science and Engineering. 2013.

  2. MET: Workload aware elasticity for NoSQLCruz, F., Maia, F., Matos, M., Oliveira, R., Paulo, J., Pereira, J., & Vilaça, R.In European Conference on Computer Systems (EuroSys). 2013. (Core A)

  3. DEDIS: Distributed Exact Deduplication for Primary Storage InfrastructuresPaulo, J., & Pereira, J.In Symposium on Cloud Computing (SOCC). 2013. (Fast Abstract)

  4. DEDIS: Exact Deduplication for Primary Distributed StoragePaulo, J., & Pereira, J.In European Conference on Computer Systems (EuroSys). 2013. (Poster)

2012

  1. DEDISbench: A Benchmark for Deduplicated Storage SystemsPaulo, J., Reis, P., Pereira, J., & Sousa, A.In International Symposium on Secure Virtual Infrastructures (DOA-SVI). 2012.

2011

  1. Model Checking a Decentralized Storage Deduplication ProtocolPaulo, J., & Pereira, J.In Latin-American Symposium on Dependable Computing (LADC). 2011. (Fast Abstract)

2009

  1. Efficient Storage of Data in Cloud Computing (MSc thesis)Paulo, J. University of Minho, Braga, Portugal, 2009. (Supervision: José Orlando Pereira)