Publications & Talks

Google Scholar

Google Scholar shows citations for academic and some other articles, but for technical articles, see below.

On Software and the Cloud

Cloud waste often hides in inefficient code, creating an “illusion of efficiency.” This article gives a systematic approach to finding cost issues that come from the code rather than cloud infrastructure.

• “Outside Advisors: Empowering your team while retaining ownership”, L8 Conference.

This talk, for an audience of Staff Engineers, introduces a pragmatic approach to incorporating external Cloud Reliability Engineering (CRE) advisors while keeping your team firmly in control of their technical destiny.

• “Taking Machine Learning to production: Cloud MLOps for speed and efficiency”.

The article version of my talk from DevFest and SRE Day (see below)

• “Cloud MLOps for speed and efficiency”, DevFest Bucharest and SRE Day London; earlier multicloud version given to the Israel NLP Meetup at Microsoft. Video of the DevFest talk and of the SRE Day talk.

I work with startups that build on an algorithm that they innovated, but hit a scaling wall as they succeed. I show how cloud MLOps platforms like Vertex provide a full spectrum of MLOps services, and services, and how to decide when to leverage each.

• “DoiT-Easily: Simplifying your Google Marketplace Vendor Development”

How to use the DoiT-Easily open-source project as a basis for developing your Google Marketplace offerings.

• “Sell your SaaS on Google Marketplace”

How to integrate your solution into Google Marketplace.

• “No WAFs: Don’t use a Web Application Firewall, and when you should, anyway” (Conference Talk). (Video). Bsides Strasbourg; also at Conf42 (Video )

WAFs look like a quick-fix for security problems, but generally lead to worse security. Yet sometimes they are needed. This article sorts that out.

• “No WAFs: Don’t use a Web Application Firewall, and when you should, anyway” (Article)

WAFs look like a quick-fix for security problems, but generally lead to worse security. Yet sometimes they are needed. This article sorts that out.

• “Controlling cloud costs: Where to start, and where to go from there”, StackExchange Podcast.

We explored the importance of controlling and understanding cloud costs, the role of good architecture in cost optimization, and strategies for dealing with surprise costs.

• “Level 7 Egress Control: Just now emerging in Kubernetes”, a full talk given at SRECon Dublin (video); and various short versions at DevOpsDays Vilna, DevOpsDays Ukraine, and Kubernasties Tel Aviv. See videos of the short talks in English; Yiddish “װי מען רעגולירט אַרױסגאַנג לױטן ראַיאָן: כטולהון אַרט עס נישט”; and Hebrew “שליטה על יציאה מאשכול על פי מתחם”.

Controlling outgoing traffic from a VPN or a K8s cluster based on Level 7 Domain address.

• “Outside advisors: A counter-intuitive approach to Customer Reliability Engineering”, DevOpsDays, Vilna, Lithuania; Kubernetes Users Group, Singapore. Video recording

How does SRE work with outside advisers? If outsiders run DevOps, costs mount quickly. There is a better way. I will show how strictly defined limits on what outside CRE does, lower cost, build in-house skills, and leverage outside expertise for both architecture and urgent response.

• “Flexibility in Vizier’s Black Box Optimization”

Further abilities of Vertex AI Vizier

• “The Advantages of Vizier’s Black Box Approach”

Comparing Vizier to other hyperoptimizers

• “Vertex AI Vizier for fewer repetitions of costly ML training “

Introducing Vizier and Black Box Optimization

• “Untangling the Tangled Cloud”, (video) Usenix SRECon Singapore.

How do you arrange virtual machines, databases, and other services into logical groups?

Whether with Google Cloud projects, AWS accounts, or Azure resource groups, my consulting customers find that either lumping all the resources together or parceling them out into tiny groups makes management, security, and cost analysis too difficult: It’s tough predicting the impact of a change.

In this talk, related to my article at Usenix :login;, I explain how I advise architects to make their infrastructure follow the logical boundaries of microservices and the organization.

• “FQDN Egress Control in Kubernetes”, TFiR.

Allowing access only to specific domains from your Kubernetes application; limiting this access to pods in certain namespaces or with certain labels.

• “Allow outgoing traffic by domain: FQDN Egress Control”

When you block your applications in a VPC from outbound connections, but want to allow access to just one domain, a regular firewall won’t do. This article describes various approaches, including old standbys, two Preview services in Google Cloud, and AWS services. It also discusses Kubernetes-aware solutions and mentions an upcoming Kubernetes standard.

• “Untangling the Cloud: A Principled Method for Grouping Cloud Resources”, ;login:, the Usenix technical journal.

How to draw technical borders to divide your cloud resources into groupings, such as Google Cloud projects, AWS accounts, or Azure resource groups.

• “Black Box Optimization with Vizier AI Vizier”, at multiple conferences including Google DevZone Day at Google Tel Aviv; Google Atlantis Summit; GeeCON Prague; DevFest Zagreb; Google Israel Cloud Summit; Osijek GDG. See the talk here

The first-ever talk or article about the new Black Box Optimization interaction pattern of Vertex Vizier, a new way to optimize across slow and expensive iterations.

• “Authentication between microservices. Is it really that hard?”, Google Cloud Medium Publication.

I’ve always found it difficult to securely authenticate between microservices. And in fact, though there are plenty of ways to do it, doing it securely is not easy.

• “Multitenant inference architecture with SageMaker endpoints”, Multicloud Engineering Meetup.

With Michael Pelts, Senior Solutions Architect at AWS, explore ways to allow tenants to expose their own models for inference when ensuring that it is accessed only by the tenant that created it.

• “Throughput Metrics Across the Clouds”

This article describes the conclusions from my open-source project, which is the first that I know of measuring throughput (most others focus on latency) and doing so across clouds and relating it to distance.

• “AWS Machine Learning at the start of 2022

My colleagues and I discuss the latest from AWS in Machine Learning and Artificial Intelligence.

• “Let the Computer Enforce It For You”

How to document your code for best results: Keep the documentation close to the code, and machine-enforced.

• “Implementing SaaS Tenant Isolation Using Amazon SageMaker Endpoints and IAM”

Machine Learning Software-as-a-Service providers that serve a lot of tenants need to keep these tenants prediction endpoints separate. Here are several ways to do that, balancing expense and robustness. We show an advanced use of Identity and Access management for a flexible access control to the Sagemaker inferencing endpoints.

• “Cloud Blaster: How to clean up your Google Cloud Project”

As you experiment with the Google Cloud, your project tends to get cluttered. This open-source project cleans it up.

• “AWS Firewalls: How and When to Use Each One”

When I saw that AWS had a new firewall with the uninformative name “Network Firewall,” I thought “not another one.” This is my attempt to sort them all out. Then Jeff Barr, AWS Chief Evangelist, tweeted it!

• “Kotlin, Gradle, and the Cloud”

How to build Kotlin apps in the cloud with Gradle.

• “Resource Labeling with Iris3”

My Iris 3 open-source project adds labels to resources like VM instances, Pub Sub Topics, and more, making for more detailed cost reports.

• “The Quickest Quickstarts”

When I want to use a new technology, I first want a script that gives me a working “Hello World”; I can tweak from there. This article presents nine such solutions for different compute infrastructures on AWS and GCP.

• “From Notebook to AWS’’

Step by step from a research Jupyter Notebook to a AWS distributed Machine Language Deployment.

• “Looking for an emulator for Google Cloud Tasks?”

Though an emulator is often requested, Google does not offer one for developing with Cloud Tasks, comparable to what if offers for Datastore or PubSub. I created one.

• “Safe Scrub: Clean up your Google Cloud projects.’’

An open-source tool I wrote for safely deleting the resources cluttering your projects.

• “The Hidden Costs of Datastore’’

Datastore export costs don’t show up in Google Cloud Monitoring. Here’s how to set up real-time alerts to better keep track of export costs.

• “You can handle pods, but what about clusters?’’

Introducing a new open-source tool that I wrote for cloning clusters between the Google, Azure, and Amazon clouds.

• “How Your Web App Can Serve the Chinese Market’’

For web app developers, serving users in China requires a completely different way of thinking. Here are the key steps you should take.

• “How to Best Prepare for your Cloud Certification Exam’’

The best preparation materials for Google and AWS Cloud Architect Certification as you get ready for those critical two or three hours of the exam.

• “Build on Your Experience to Earn Cloud Certifications’’

How to quickly absorb the knowledge that you don’t yet have, building on your professional strengths.

• “Google App Engine Flexible Environment: Beyond the Bounds of PaaS”. The first ever talk or article dedicated to this new Google PaaS. Multicloud Engineering,.

• “Breaking boundaries: How Freightos achieved high speed graph search in the cloud ,” CloudTech.

Running heavy-duty graph algorithms against a very large dataset require some unusual design principles. Freightos may not be the only company doing it, but no cloud platform today is optimized for this; in fact, the usual design assumptions in cloud platforms are quite the opposite of what we needed. Here is how we did it.

• “Modeling Retail Products: A Big Data Approach,” The Software Generalist.

• “ Search and You Shall Find ,” Medium.

Today’s e-tail search engines return inaccurate results; merchants stuff all product information into long titles. To optimize revenue, online retailers need a search engine that understands the product selection.

• “ Apache Spark and Java 8: The Big Data Team ,”, Datanami.

Apache Spark with Java 8 is proving to be the perfect match for Big Data. In this article, I show an example of collaborative filtering using Spark on Cassandra data, and explain how much easier this is to do with the lambdas of Java 8. Code to accompany it is here at GitHub.

• “Documents in the cloud: Dynamic, Privacy-customized views,” Cloudbook.

As documents move to the cloud, it becomes harder to protect the private information in them, but on the other hand becomes easier to control distribution of specific private information to exactly the people who are authorized to see it.

• “Flexible, Dynamic Redaction”, MasterDataManagement.com.

Complying with privacy regulations used to mean “redaction,” blacking out words with a pen, slowly and expensively. But natural language processing techniques can protect exactly the information regulated by law while giving convenient access to authorized users.

• “Breaking Walls: How to Get Departments to Share Information,” with Michael Pelts, Technology and Humans

• “People Who Live in Glass Houses Should Put Up Some Shades ,” InfoSecurity.

Too much openness, as well as too little, both pose risks. Document viewing with automated privacy control is one part of the balance. Allowing authorized users to retrieve the redacted information is another.

• “Privacy for the Deeper Web,” with Michael Pelts, Technology and Humans.

• “Openness and Privacy for Regulatory Compliance,” with Michael Pelts, Information on Demand Europe.

• “IBM Optim Data Redaction: Reconciling Openness with Privacy,” IBM White Paper.

The White Paper for the product which I launched in IBM.

• “Openness and Privacy,” with Michael Pelts, Security and Privacy Symposium.

• “Clojure: Challenge your Java Assumptions ,” JavaWorld.

The article is aimed at senior Java developers, encouraging them to learn more about this exciting language. A dialect of Lisp, Clojure runs on the JVM with excellent integration with Java, and provides new, improved solutions for the biggest challenge to programming languages today: concurrency.

• “Mining Meaning from Java Code with Java Data Mining API”, JavaOne, San Francisco. As with the earlier talk, the JavaOne committee gave this presentation the “Cool Stuff” award.

• “Mining for Services: Discovering Business Realities in Mainframe Metadata,” IMPACT, Las Vegas.

• “Finding Mashup Ingredients,” Web 2.0 and Beyond Summit.

• “Mining for Meaning: Discovering Business Realities in Mainframe Metadata,” Mainframe Executive.

To expose siloed mainframe functionality now locked up in siloed systems, it is essential to understand its business value. Automated classification technologies help make this happen.

• “Metadata Mining: Automated Semantic Classification for Service Repositories,” XML Conference, Boston.

• “Approaches for Modeling Metadata in XML”, industry experts panel, XML Conference, Boston

• “The Portal as People-Centric SOA,” MainSoft Corporation White Paper.

As a consultant for a leading provider of Java-.NET interoperability software, I wrote a white paper evangelizing the company’s IBM WebSphere Portal product, showing how it functions as a user-facing on-ramp to SOA.

• “The Hub and the Edge: Balancing the Responsibilities,” Architecture and Governance Magazine.

Architecture is as much about the organization as about technology. This article explains how to divide responsibilities in Service Oriented Integration to let each team do what they do best.

• “JRuby on Rails ,” JavaWorld.

The article explain Ruby on Rails to Java developers, comparing it to Java web frameworks. It presents an example based on JavaSpaces, which leverages Java from within the Rails application. Even those Java developers who do not adopt Rails will benefit from the design principles built into the framework, as well as the rapidly emerging concept of non-Java languages integrated with Java and the JVM.

• “Ruby for the Java World ,” JavaWorld.

Dynamic languages are rapidly gaining in popularity. Ruby in particular has attracted attention, with a big boost from the Ruby on Rails Web framework. In this article, I introduce Java programmers to Ruby, focusing on the similarities, differences, and connectivity between the two languages, and describing the value of JRuby on the Java-platform. The article got some buzz on the net, including from Frank Sommers at Artima.

• “Enterprise Semantics: Aligning Service Oriented Architecture with the Business ,” with Joram Borenstein, Web Services Journal.

A business-focused overview of the value that semantics bring to Service Oriented Architectures.

• “Business Processes: Connecting the Design-Time to the Run-time” (presentation here), Programming Languages and Development Environments Seminar, Haifa.

• “XMI and the Many Metamodels of Enterprise Metadata,” with Joram Borenstein, XML Conference, Atlanta.

• “Semantic Enterprise Systems Management for Program Trading,” with Simon de M. Walker, Securities Industry Middleware Council, New York.

• “Java Metadata and the Semantic Web,” JavaOne, San Francisco. Awarded “Intriguing and Unexpected: New and Cool” by the conference committee.

• “Aligning Business Process and Business Information Models: a Semantic Approach,” with Zvi Schreiber, Global Business Process Forum, London.

• “Metadata Management Converges with Business Modeling,” DAMA Symposium and Wilshire Meta-Data Conference, Los Angeles.

• “Semantic Information Management for Data Integration in the Enterprise,” Tech Target.

• “Semantic Information Management: Controlling Complexity with a Central Information Hub,” Securities Industry Middleware Council, New York.

• “Web Services: Trends Toward Adoption,” invited appearance at industry experts panel, EIDX/CompTIA, San Diego.

• “Know What Your Schemas Mean: Semantic Information Management for XML Assets,” XML Conference.

Schemas control the structure of information, But they don’t specify what a field means. Is that “salary” field monthly or annual? Semantic data management helps you keep track and avoid expensive mistakes.

• “Active Information Models for Data Transformation,” eAI Journal (later renamed to Business Integration Journal and Align Journal).

EAI gives O(n) complexity for connecting n applications on the network, but there remains an O(n2) complexity for integrating the message formats that the applications use as input and output. With an ontology-based approach, however, this too can be reduced to O(n).

• Semantic Discovery for Web Services,” with Joram Borenstein, Web Services Journal.

Web Services lookup with UDDI requires client and server to agree on the exact syntax of the interaction. Using the principles of ontology, providers can publish and clients can discover Services based on the desired functionality rather than the syntactic details.

• “Generating XSLT with a Semantic Hub,” XML Conference.

XLST was a promising XML technology that never fulfilled its promise because it was so hard to write and maintain. But when generated automatically from semantic information about what data is used for, XSLT becomes an automated information interchange language.

• “Information Quality through Semantic Models ,” Enterprise Data Forum, Pittsburgh.

• “Deploying Jini: HTTP Servers for the Dynamic Download of Code ,” JavaWorld.

I’ve found that once new Jini developers learn about the exciting distributed architecture, they often get bogged down by the challenge of simply configuring their system for development. They encounter a yet greater challenge in moving from the development configuration to deployment. Even experienced developers can get confused by the variety of components involved.

In the article, I review a number of solutions and explain the advantages of various solutions such as ease of development,ease of migration from development to deployment, low memory and CPU burden, portability, compatibility with RMI Activation, security, and enterprise-class web-app features.

• “Ontology: Automated Integration of Enterprise XML,” Web Services Edge, Santa Clara.

• “Building a Successful Wireless Web Site,” Wireless Business and Technology.

If you’re a software development manager with experience leading the development of a three-tier distributed application for the World Wide Web, perhaps you’re about to move on to spearhead the construction of a WAP-site. This article has reuse as its theme: I explain when you can reuse skillsets, infrastructure, and software components from the WWW site, and when you’re better off developing new skills, buying new infrastructure, or building new software.

• “When is a Singleton not a Singleton ,” JavaWorld (republished in Infoworld, and Sun/Oracle; appeared on the Java Developer Connection front page).

Sometimes you implement the Singleton Design Pattern, but mysteriously find that more than one object of the class is instantiated. This article explains how that can happen and how to avoid it.

• “Distributed Garbage Collection and Socket Option Keep-Alive,” JavaOne, San Francisco.

• “Opaque Bodies, Transparent Envelopes ,” XML-Journal.

Separating layers of abstraction by packaging a body of one layer in an envelope of another layer is one of the fundamental design principles in data transfer. This principle holds for XML just as for any data transfer format, but implementing a system that observes layer separation can be difficult. This article describes how to do it.

• “Developing Java Servlets.” Software Productivity Center, Vancouver, Canada.

• “So what is SO_KEEPALIVE? ,” Dr. Dobb’s Journal.

Garbage collecting distributed leases requires mechanisms such as keep-alives, heartbeats, leases, and Are-You-There/I-Hear-You protocols. Interestingly, the keep-alive mechanism built into TCP/IP sockets is not really practical; for this reason and the JDK didn’t allow access to Socket Option Keep-Alive until the recent release of JDK 1.3. I explain the problems with SO_KEEPALIVE and how to implement your own garbage collection mechanism for distributed resources.

• Seminars: Servlets, JSP, RMI and JDBC. Interbit Training and Consulting, a Sun Microsystems Training Center, Herzliya, Israel.

• “Collaborative Applications with the Java Shared Data Toolkit ,” Dr. Dobb’s Journal.

I describe and review a toolkit for allowing distributed applications to share objects, and more generally discuss the challenges of managing distributed objects. The JSDT was an official product from Sun, although it never made it to the status of a Java extension. It implemented some interesting ideas for distributing objects. I enjoyed collaborating with the creator, and some of my suggestions (like one on failure detection), actually made it into the toolkit.

Recruiting and Career

Working as a cloud consultant at DoiT International: About our unique work-style: I have never seen anything like this.
On informational interviews, from the employer’s perspective, at ERE, the leading site for tech recruiters.
It also appeared at Israel’s top news site for recruiters, gius.co.il.
On the game-theoretical perspective on kegerators at Pando.
On building open-source communities by recruiting top tech talent from among the members, at OpenSource.com.
Career tips for Generation Y professionals at Brazen Careerist.
The rise of the anti-recruiter at ReadWrite.
On a new approach to recruiting at Information Week, with Jack Perkins.
On effective employer branding at ERE.
On informational Interviews, from the professional’s perspective, at Business Insider.