Deleting¶
Description
Deleting content items in Plone programmatically. How link integrity checks work and how (and when!) to avoid them.
Introduction¶
This document explains how to programmatically delete objects in Plone.
Deleting content by id¶
Deleting content objects is done by IObjectManager.
Example:
# manage_delObjects takes list of ids as an argument
folder.manage_delObjects(["list", "of", "ids", "to", "delete"])
Or:
parent = context.aq_parent
parent.manage_delObjects([context.getId()])
Permissions¶
The user must have Zope 2
Delete objects permission on the
content item being deleted. This is checked in
Products.CMFPlone.PloneFolder.manage_delObjects()
.
Otherwise an
Unauthorized
exception is raised.
Example how to check for this permission:
from Products.CMFCore import permissions
hospital = self.portal.country.hospital
item = hospital.patient1
mt = getToolByName(self.portal, 'portal_membership')
if mt.checkPermission(permissions.DeleteObjects, item):
# Can delete
raise AssertionError("Oooops. Deletion allowed")
else:
pass
Bypassing permissions¶
This is handy if you work e.g. in a debug shell and you are deleting badly behaved objects:
from AccessControl.SecurityManagement import newSecurityManager
admin = app.acl_users.getUserById("admin")
app.folder_sits.sitsngta.manage_delObjects("examples")
# Try harder:
# app.folder_sits.sitsngta._delObject("examples", suppress_events=True)
import transaction ; transaction.commit()
Deleting all content in a folder¶
This can be a bit tricky. An example:
ids = folder.objectIds() # Plone 3 or older
ids = folder.keys() # Plone 4 or newer
if len(ids) > 0:
# manage_delObject will mutate the list
# so we cannot give it tuple returned by objectIds()
ids = list(ids)
folder.manage_delObjects(ids)
Bypassing link integrity check¶
If link integrity checks has been enabled in the site setup, you cannot delete objects which themselves are link targets or if their children are link targets.
Instead, a
LinkIntegrityException
will be raised. The
LinkIntegrityException
contains information identifying the content which could
not be deleted.
The
plone.app.linkintegrity.browser.remote
module contains code which allows you to delete the object
in any case. It catches the exception, modifies the HTTP
request to contain a marker interface allowing delete to
happen and then replays the transaction.
In case the link integrity check fails for
manage_delObjects()
, you will be shown a confirmation dialog. The original
request payload gets pickled and is stored encoded in the
HTML form.
When the user presses confirm, the original request gets
unpickled from the
HTTP
POST
payload. Then the view modifies the Zope publisher so that
it will play the original unpickled
HTTP
POST
with the marker interface "Do not care about link
integrity breaches" turned on.
Here is an sample batch delete code which tries to work around the link integrity check:
from zope.component import queryUtility
from Products.CMFCore.interfaces import IPropertiesTool
# We need to disable link integrity check,
# because it cannot handle several delete calls in
# one request
ptool = queryUtility(IPropertiesTool)
props = getattr(ptool, 'site_properties', None)
old_check = props.getProperty('enable_link_integrity_checks', False)
props.enable_link_integrity_checks = False
for b in items:
count += 1
obj = b.getObject()
logger.info("Deleting:" + obj.absolute_url() + " " + str(obj.created()))
try:
obj.aq_parent.manage_delObjects([obj.getId()])
except Exception, e:
# E.g. linkintegrityerror or some other
logger.error("Could not remove item:" + obj.absolute_url())
logger.exception(e)
continue
if count % transaction_threshold == 0:
# Prevent transaction becoming too large (memory buffer)
# by committing now and then
logger.info("Committing transaction")
transaction.commit()
props.enable_link_integrity_checks = old_check
logger.info(msg)
Fail safe deleting¶
Sometimes deletion might fail because it dispatches events which might raise exception due to bad broken objects or badly behaving code.
OFS.ObjectManager, the base class for Zope folders, provides an internal method to delete objects from a folder without firing any events:
# Delete object with id "broken-folder" without firing any delete events
site._delObject("broken-folder", suppress_events=True)
The best way to clean up bad objects on your site is via a command line script, in which case remember to commit the transaction after removing the broken objects.
Purging site from old content¶
This term:ZMI script allows you to find content items of certain type and delete them if they are created too long ago:
# Delete FeedfeederItem content items which are more than three months old
from StringIO import StringIO
import DateTime
buf = StringIO()
# DateTime deltas are days as floating points
end = DateTime.DateTime() - 30*3
start = DateTime.DateTime(2000, 1,1)
date_range_query = { 'query':(start,end), 'range': 'min:max'}
items = context.portal_catalog.queryCatalog({
"portal_type":"FeedFeederItem",
"created" : date_range_query,
"sort_on" : "created" })
items = list(items)
print >> buf, "Found %d items to be purged" % len(items)
count = 0
for b in items:
count += 1
obj = b.getObject()
print >> buf, "Deleting:" + obj.absolute_url() + " " + str(obj.created())
obj.aq_parent.manage_delObjects([obj.getId()])
return buf.getvalue()
Below is an advanced version for old item-date-based
deletion code which is suitable for huge sites. This
snippet is from the
Products.feedfeeder
package. It will look for
Feedfeeder
items (automatically generated from RSS) which are older
than X days and delete them.
It's based on Zope 3 page registration (sidenote: I noticed that views do not need to be based on BrowserView page class).
- Transaction thresholds make sure the code runs faster and does not run out of RAM
- Logging to Plone event log files
- Number of days to look into past is not hardcoded
- Manage rights needed to execute the code
You can call this view like:
http://localhost:9999/plonecommunity/@@feed-mega-cleanup?days=90
Here is the view Python source code:
import logging
import transaction
from zope import interface
from zope import component
import DateTime
import zExceptions
logger = logging.getLogger("feedfeeder")
class MegaClean(object):
""" Clean-up old feed items by deleting them on the site.
This is intended to be called from cron weekly.
"""
def __init__(self, context, request):
self.context = context
self.request = request
def clean(self, days, transaction_threshold=100):
""" Perform the clean-up by looking old objects and deleting them.
Commit ZODB transaction for every N objects to that commit buffer does not grow
too long (timewise, memory wise).
@param days: if item has been created before than this many days ago it is deleted
@param transaction_threshold: How often we commit - for every nth item
"""
logger.info("Beginning feed clean up process")
context = self.context.aq_inner
count = 0
# DateTime deltas are days as floating points
end = DateTime.DateTime() - days
start = DateTime.DateTime(2000, 1,1)
date_range_query = {'query':(start,end), 'range': 'min:max'}
items = context.portal_catalog.queryCatalog({
"portal_type": "FeedFeederItem",
"created": date_range_query,
"sort_on": "created" })
items = list(items)
logger.info("Found %d items to be purged" % len(items))
for b in items:
count += 1
obj = b.getObject()
logger.info("Deleting:" + obj.absolute_url() + " " + str(obj.created()))
obj.aq_parent.manage_delObjects([obj.getId()])
if count % transaction_threshold == 0:
# Prevent transaction becoming too large (memory buffer)
# by committing now and then
logger.info("Committing transaction")
transaction.commit()
msg = "Total %d items removed" % count
logger.info(msg)
return msg
def __call__(self):
days = self.request.form.get("days", None)
if not days:
raise zExceptions.InternalError("Bad input. Please give days=60 as HTTP GET query parameter")
days = int(days)
return self.clean(days)
Then we have the view ZCML registration:
<page
name="feed-mega-cleanup"
for="Products.CMFCore.interfaces.ISiteRoot"
permission="cmf.ManagePortal"
class=".feed.MegaClean"
/>