Add pooling documentation.

2009-09-20 16:23:41 +02:00 · 2009-09-20 16:23:41 +02:00 · 444c600520
commit 444c600520
parent f203dd1c02
1 changed files with 77 additions and 0 deletions
--- a/pooling.rst
+++ b/pooling.rst
@ -0,0 +1,77 @@
+========================
+ Pooling with `pylibmc` 
+========================
+
+:Author: Ludvig Ericson <ludvig circled-a lericson dot se>
+
+.. This is really a blog post, I do write them in ReST occasionally. Provided
+   here for the sake of convenience.
+
+I was discussing how to implement pooling for `pylibmc` when I realized what
+`libmemcachedutil`'s pooling is - or rather, what it isn't.
+
+It's not a magical solution for concurrently doing anything at all, it's not
+anything like that -- it just helps you with thread-safety.
+
+In Python, however, we've got the global interpreter lock, the GIL. This lock
+must always be held by the thread that is dealing with anything Python. The
+Python interpreter itself isn't thread-safe, or rather, it is with the GIL.
+
+This means that whenever Python code is running, you'll be sure to have
+exclusive access to all of Python's memory (unless something is misbehaving.)
+In turn, this means that the usecase for using `libmemcachedutil` in a Python
+library is rather slim.
+
+Let's have a look at some code for doing the equivalent in pure Python. This is
+a Werkzeug-based WSGI application which would be run in multiple threads,
+concurrently:
+
+.. code-block:: python
+
+    # Configuration
+    n_threads = 12
+    mc_addrs = "10.0.1.1", "10.0.1.2", "10.0.1.3"
+    mc_pool_size = n_threads
+
+    # Application
+    import pylibmc
+    from contextlib import contextmanager
+    from pprint import pformat
+    from werkzeug import Request, Response
+    from werkzeug.exceptions import NotFound
+
+    class ClientPool(list):
+        @contextmanager
+        def reserve(self):
+            mc = self.pop()
+            try:
+                yield mc
+            finally:
+                self.append(mc)
+
+    mc = pylibmc.Client(mc_addrs)
+    mc_pool = ClientPool(mc.clone() for i in xrange(mc_pool_size))
+
+    @Request.application
+    def my_app(request):
+        with mc_pool.reserve() as mc:
+            key = request.path[1:].encode("ascii")
+            val = mc.get(key)
+            if not val:
+                return NotFound(key)
+            return Response(pformat(val))
+
+It's fully-functional example of how one could implement pooling with
+`pylibmc`, and very much so in the same way that people do with
+`libmemcachedutil`. To start it, you could use Spawning like so:
+``spawn -s 1 -t 12 my_wsgi_app.my_app``
+
+I don't know if I think the above methodology is the best one though, another
+possibility is to have a dict with thread names as keys and client objects for
+values, then, each thread would look up its own client object in the dict on
+each request, and if none exists, it clones a master just like the pooling
+thing above.
+
+It'd be neat if there was a generic Python API for doing any variant of
+pooling, per-thread or the list-based version, and then you'd be able to switch
+between them seamlessly. Hm.