<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Cassandra Internals &#8211; Writing</title>
	<atom:link href="http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/</link>
	<description>On Ruby, software and the Internet</description>
	<lastBuildDate>Tue, 31 Aug 2010 20:32:50 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Cassandra内核介绍&#8211;写操作 &#171; a db thinker&#39;s home</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-656</link>
		<dc:creator>Cassandra内核介绍&#8211;写操作 &#171; a db thinker&#39;s home</dc:creator>
		<pubDate>Sun, 30 May 2010 04:03:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-656</guid>
		<description>[...]  Cassandra内核介绍&#8211;写操作 by Mike Perham Translated by [...]</description>
		<content:encoded><![CDATA[<p>[...]  Cassandra内核介绍&#8211;写操作 by Mike Perham Translated by [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jametong</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-652</link>
		<dc:creator>jametong</dc:creator>
		<pubDate>Sat, 29 May 2010 06:38:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-652</guid>
		<description>Oracle has use dbwr/lgwr/ckpt systm processes do similar jobs as Cassandra does. I refer it as change sync jobs to async jobs, and change random i/o to sequential i/o. Cassandra does it both in Commit log and SSTable.</description>
		<content:encoded><![CDATA[<p>Oracle has use dbwr/lgwr/ckpt systm processes do similar jobs as Cassandra does. I refer it as change sync jobs to async jobs, and change random i/o to sequential i/o. Cassandra does it both in Commit log and SSTable.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Lihn</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-640</link>
		<dc:creator>Steve Lihn</dc:creator>
		<pubDate>Mon, 24 May 2010 01:55:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-640</guid>
		<description>Nice article. Just curious when you said &quot;Unlike a database, Cassandra does not update data in-place on disk, nor update indices, so there’s no intensive synchronous disk operations to block the write.&quot;, which database are you referring to? Oracle can modify data blocks in memory and only write the redo log to disk, then move on. It seems to be Cassandra is similar to what Oracle is doing in this respect, isn&#039;t it?</description>
		<content:encoded><![CDATA[<p>Nice article. Just curious when you said &#8220;Unlike a database, Cassandra does not update data in-place on disk, nor update indices, so there’s no intensive synchronous disk operations to block the write.&#8221;, which database are you referring to? Oracle can modify data blocks in memory and only write the redo log to disk, then move on. It seems to be Cassandra is similar to what Oracle is doing in this respect, isn&#8217;t it?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: OCTO talks ! &#187; no:sql(eu) et NoSQL : qu&#8217;en est il?</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-594</link>
		<dc:creator>OCTO talks ! &#187; no:sql(eu) et NoSQL : qu&#8217;en est il?</dc:creator>
		<pubDate>Thu, 22 Apr 2010 22:00:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-594</guid>
		<description>[...] - NoSQL concerne l&#8217;utilisation de la donnée et nous rappelle qu&#8217;un point important &#8211; et je pense souvent oublié dans nos SIs traditionnels &#8211; est de comprendre comment la donnée est utilisée. Un système peut vivre avec de la donnée inconsistante. En fait, c&#8217;est déjà le cas si vous utilisez les mécanismes de réplication Master/Slave, du cache ou simplement l&#8217;utilisation de messages asynchrones. Juste se demander comment sera utiliser la donnée.   Que se passera-t-il du point de vue du métier si mon opération (disons un retrait) est consistant mais que mon solde reste inconsistant pendant quelques secondes ou même quelques minutes?  Ma donnée sera-t-elle accéder de manière concurrente? Y aura-t-il une forte probabilité de conflit sur cette donnée? En cas de conflit, de quelle stratégie de &#171;&#160;read-repair&#160;&#187; aurai-je besoin (un modèle type Vector Clock ou plus simplement l&#8217;approche timestamp de Cassandra)?   Combien de temps y a t-il entre le moment ou j&#8217;écris ma donnée et le moment où elle est lue? Du coup, quelle est la probabilité de &#171;&#160;Stale State&#160;&#187;? S&#8217;il y a un une heure entre les deux opérations, la probabilité est faible&#8230; Ma donnée est-elle plutôt utilisée en écriture ou en lecture? Certains de ces systèmes ont été optimisés pour l&#8217;écriture. Par exemple, Cassandra réalise plus d&#8217;accès disque pour une lecture que pour une écriture. [...]</description>
		<content:encoded><![CDATA[<p>[...] &#8211; NoSQL concerne l&#8217;utilisation de la donnée et nous rappelle qu&#8217;un point important &#8211; et je pense souvent oublié dans nos SIs traditionnels &#8211; est de comprendre comment la donnée est utilisée. Un système peut vivre avec de la donnée inconsistante. En fait, c&#8217;est déjà le cas si vous utilisez les mécanismes de réplication Master/Slave, du cache ou simplement l&#8217;utilisation de messages asynchrones. Juste se demander comment sera utiliser la donnée.   Que se passera-t-il du point de vue du métier si mon opération (disons un retrait) est consistant mais que mon solde reste inconsistant pendant quelques secondes ou même quelques minutes?  Ma donnée sera-t-elle accéder de manière concurrente? Y aura-t-il une forte probabilité de conflit sur cette donnée? En cas de conflit, de quelle stratégie de &laquo;&nbsp;read-repair&nbsp;&raquo; aurai-je besoin (un modèle type Vector Clock ou plus simplement l&#8217;approche timestamp de Cassandra)?   Combien de temps y a t-il entre le moment ou j&#8217;écris ma donnée et le moment où elle est lue? Du coup, quelle est la probabilité de &laquo;&nbsp;Stale State&nbsp;&raquo;? S&#8217;il y a un une heure entre les deux opérations, la probabilité est faible&#8230; Ma donnée est-elle plutôt utilisée en écriture ou en lecture? Certains de ces systèmes ont été optimisés pour l&#8217;écriture. Par exemple, Cassandra réalise plus d&#8217;accès disque pour une lecture que pour une écriture. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: OCTO talks ! &#187; no:sql(eu) and NoSQL: What’s the deal?</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-593</link>
		<dc:creator>OCTO talks ! &#187; no:sql(eu) and NoSQL: What’s the deal?</dc:creator>
		<pubDate>Thu, 22 Apr 2010 17:47:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-593</guid>
		<description>[...] - NoSQL is about data usages and reminds us that the most important &#8211; and I guess the forgotten part in our traditional systems &#8211; is to understand how data is used. A system can live with inconsistent data (in fact this is already the case if you are using database Master/Slave replication, caching or asynchronous approaches). This is not a big deal in most of the cases but we need to keep that in mind (I must admit that we, as architects, often forget about this concern since we have been using RDBMS and ACID transactions for a long time).   What happens for the business if an operation (let&#8217;s say a withdrawal) is consistent but my balance stays inconsistent a couple of seconds or even a couple of minutes?   Will my data be accessed in a highly concurrent manner and so do I have a high probability to get conflicts on that data? In case of conflicts, do I need complex conflict resolutions strategies for read-repair? more specifically, do I need a Vector Clock model or a much simpler but also efficient mechanism based on timestamps like in Cassandra?   How many revisions of my objects do I need to keep? For instance, Riak stores revisions of objects that can be used for conflicts resolutions but the number of revisions you need to keep has to be correlated to the probability of having a conflict on that data.   How much time do I have between the moment my data is written and the moment that same data is read? And what is the probability of being in an inconsistent state? If there is one hour between the write and the read, the probability is quite low.  Is my data mainly used for reads or writes ? Some of these systems have been optimized for writes and less for reads. For instance Cassandra does many more disk accesses during a read than during a write  [...]</description>
		<content:encoded><![CDATA[<p>[...] &#8211; NoSQL is about data usages and reminds us that the most important &#8211; and I guess the forgotten part in our traditional systems &#8211; is to understand how data is used. A system can live with inconsistent data (in fact this is already the case if you are using database Master/Slave replication, caching or asynchronous approaches). This is not a big deal in most of the cases but we need to keep that in mind (I must admit that we, as architects, often forget about this concern since we have been using RDBMS and ACID transactions for a long time).   What happens for the business if an operation (let&#8217;s say a withdrawal) is consistent but my balance stays inconsistent a couple of seconds or even a couple of minutes?   Will my data be accessed in a highly concurrent manner and so do I have a high probability to get conflicts on that data? In case of conflicts, do I need complex conflict resolutions strategies for read-repair? more specifically, do I need a Vector Clock model or a much simpler but also efficient mechanism based on timestamps like in Cassandra?   How many revisions of my objects do I need to keep? For instance, Riak stores revisions of objects that can be used for conflicts resolutions but the number of revisions you need to keep has to be correlated to the probability of having a conflict on that data.   How much time do I have between the moment my data is written and the moment that same data is read? And what is the probability of being in an inconsistent state? If there is one hour between the write and the read, the probability is quite low.  Is my data mainly used for reads or writes ? Some of these systems have been optimized for writes and less for reads. For instance Cassandra does many more disk accesses during a read than during a write  [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cheatsheet: 2010 04.01 ~ 04.07 - gOODiDEA.NET</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-554</link>
		<dc:creator>Cheatsheet: 2010 04.01 ~ 04.07 - gOODiDEA.NET</dc:creator>
		<pubDate>Wed, 07 Apr 2010 02:20:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-554</guid>
		<description>[...] Cassandra Internals &#8211; Writing - Cassandra Write Operation Performance Explained [...]</description>
		<content:encoded><![CDATA[<p>[...] Cassandra Internals &#8211; Writing &#8211; Cassandra Write Operation Performance Explained [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lenin Gali</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-518</link>
		<dc:creator>Lenin Gali</dc:creator>
		<pubDate>Sat, 20 Mar 2010 09:30:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-518</guid>
		<description>Mike, great article. I hope you can write one about reading from cassandra and how that works. I am curious about your EC2 experience, if you can share some details on 
1.what kind of performance are you getting, how many writes vs reads do you do per min?
2. have you used EBS or local disks and 
3. what would you recommend from your experience to those who would like to try out cassandra on EC2?
4. Some one suggested to use Zookeeper for adding locks on top Cassandra? what are pros and cons of doing that?</description>
		<content:encoded><![CDATA[<p>Mike, great article. I hope you can write one about reading from cassandra and how that works. I am curious about your EC2 experience, if you can share some details on<br />
1.what kind of performance are you getting, how many writes vs reads do you do per min?<br />
2. have you used EBS or local disks and<br />
3. what would you recommend from your experience to those who would like to try out cassandra on EC2?<br />
4. Some one suggested to use Zookeeper for adding locks on top Cassandra? what are pros and cons of doing that?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Roger Schildmeijer</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-515</link>
		<dc:creator>Roger Schildmeijer</dc:creator>
		<pubDate>Fri, 19 Mar 2010 18:34:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-515</guid>
		<description>Nice A4 summary, good work.</description>
		<content:encoded><![CDATA[<p>Nice A4 summary, good work.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike Perham</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-508</link>
		<dc:creator>Mike Perham</dc:creator>
		<pubDate>Wed, 17 Mar 2010 21:54:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-508</guid>
		<description>Chris, not every node has a copy of the data, just N nodes where typically N = 3.  The fact that the write will eventually propagate to those three nodes makes Cassandra eventually consistent.</description>
		<content:encoded><![CDATA[<p>Chris, not every node has a copy of the data, just N nodes where typically N = 3.  The fact that the write will eventually propagate to those three nodes makes Cassandra eventually consistent.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris R.</title>
		<link>http://www.mikeperham.com/2010/03/13/cassandra-internals-writing/comment-page-1/#comment-507</link>
		<dc:creator>Chris R.</dc:creator>
		<pubDate>Wed, 17 Mar 2010 21:01:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.mikeperham.com/?p=439#comment-507</guid>
		<description>So, if each write only writes to N nodes, how do the other nodes eventually get updated so that they&#039;re eventually consistent for reads?</description>
		<content:encoded><![CDATA[<p>So, if each write only writes to N nodes, how do the other nodes eventually get updated so that they&#8217;re eventually consistent for reads?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
