<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" media="screen" href="/styles/xslt/rss.xslt"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:c9="http://channel9.msdn.com">
<channel>
	<title>Comment Feed for Channel 9 - Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
	<atom:link rel="self" type="application/rss+xml" href="http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems/RSS"></atom:link>
	<image>
		<url>http://ecn.channel9.msdn.com/o9/previewImages/100/238840_100x75.jpg</url>
		<title>Channel 9 - Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<link></link>
	</image>
	<description>Emre Kiciman is a researcher in MSR focusing on solving hard problems&amp;nbsp;in the field of&amp;nbsp;automated systems management of large scale distributed systems. Basically, he works on algorithms and systems that can be used to understand how systems are working,
 independent of complexity, by analyzing low-level data captured at runtime.</description>
	<link></link>
	<language>en</language>
	<pubDate>Sat, 25 May 2013 03:27:57 GMT</pubDate>
	<lastBuildDate>Sat, 25 May 2013 03:27:57 GMT</lastBuildDate>
	<generator>Rev9</generator>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[OK, I got to 35 minutes in and I stopped watching.&nbsp; Iguess this coul have been a useful video on exactly what Emre is working on. Instead, it seemed to me to be a self-promotional video of allocentric concepts in relation to the 'big picture' of a large-scale
 system.&nbsp; I would have expected more than concept of a Standford phD doing something within Microsoft.. like &quot;okay, i learned these concepts and I want to introduce this really cool low-level implentation of xyz I'm working on for product x&quot;
<br /><br />I'm not trying to step on toes or anything, but this seems to be a waste of the bandwith of OrcasWeb, and not a very informative channel 9 piece at all, IMHO.<p>posted by Adenocard</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962106750000000</link>
		<pubDate>Thu, 12 Oct 2006 00:44:35 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962106750000000</guid>
		<dc:creator>Adenocard</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[Adenocard, <br /><br />Wow. Sorry you feel so strongly about this... Emre can't discuss the specifics of what he's working on for hopefully obvious reasons...
<br /><br />The point of the video was to discuss the problem space. In fact, we do discuss how the low level processing works. Further, we discuss the relationship between Machine Learning (pattern recognition) and systems management. What do you mean by self-promotion?
<br />C<p>posted by Charles</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962115360000000</link>
		<pubDate>Thu, 12 Oct 2006 00:58:56 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962115360000000</guid>
		<dc:creator>Charles</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[@ 00:01:40<br /><br />&nbsp;&nbsp;&nbsp; Charles, is there a video glitch a minute and fourty seconds into the interview after you asked about figuring the density of the Universe?<br /><br />@ 00:16:45<br /><br />&nbsp;&nbsp;&nbsp; This stuff is actually really interesting. So from what I'm gathering is that Emre is simply a wan-management-mastermind? Alot of the stuff he's discussing is actually very relevant, because since I joined my new job, I've been experiencing many new technologies
 which blow me away. Some of which are what he's talking about. The company I work for is a Medical-related Facility with offices littering the South-East Coast. I get automated emails from every one of the offices whenever their server detects any type of
 instrusion, or related warnings. This type of technology is awesome! Not to mention, our servers fix themselves most of the time and then email the &quot;Techies&quot; a summary of the activity.<br /><br />Excellent Interview, Charles. Maybe he didn't get as deep into a specific subject as some others would have preferred, but I am a sucker for tech-talk, no matter how broad the conversation is - kudos.<br /><p>posted by jsampsonPC</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962191740000000</link>
		<pubDate>Thu, 12 Oct 2006 03:06:14 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962191740000000</guid>
		<dc:creator>jsampsonPC</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[
<blockquote>
<div>jsampsonPC wrote:</div>
<div>&#65279;@ 00:01:40<br /><br />&nbsp;&nbsp;&nbsp; Charles, is there a video glitch a minute and fourty seconds into the interview after you asked about figuring the density of the Universe?<br /><br /></div>
</blockquote>
<br />No glitch. I edited out my inadvertant explanation of the meaning of everything. <img src='http://ecn.channel9.msdn.com/o9/content/images/emoticons/emotion-1.gif' alt='Smiley' /> (Just culled some of my rambling. Nothing more.)<br />C<p>posted by Charles</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962237430000000</link>
		<pubDate>Thu, 12 Oct 2006 04:22:23 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962237430000000</guid>
		<dc:creator>Charles</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[I have to say I agree very much with what Adenocard said. Reason being that every day software designers are faced with very significant issues, and occasionaly a pattern comes forward from multiple inputs to help determine the &quot;right&quot; way to approach
 a problem. This is not such a case.<br /><br />This video was titled incorrectly. Previous material, and I'm sorry common practices within distributed design are farther along than this video. I think that is all I really need to say. <img src='http://ecn.channel9.msdn.com/o9/content/images/emoticons/emotion-6.gif' alt='Sad' /><br /><p>posted by JParrish</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962260050000000</link>
		<pubDate>Thu, 12 Oct 2006 05:00:05 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962260050000000</guid>
		<dc:creator>JParrish</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[
<blockquote>
<div>JParrish wrote:</div>
<div>&#65279;I have to say I agree very much with what Adenocard said. Reason being that every day software designers are faced with very significant issues, and occasionaly a pattern comes forward from multiple inputs to help determine the &quot;right&quot; way to approach
 a problem. This is not such a case.<br /><br />This video was titled incorrectly. Previous material, and I'm sorry common practices within distributed design are farther along than this video. I think that is all I really need to say.
<img src="/emoticons/emotion-6.gif" border="0"><br /></div>
</blockquote>
<br /><br />What would a better title be in this case? This is a research video, not a product or how-to. I'm not certain I understand the problem here. Can you help me understand the issue(s)? Will help the next time around...<br /><br />Thanks for the feedback,<br />C<p>posted by Charles</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962266800000000</link>
		<pubDate>Thu, 12 Oct 2006 05:11:20 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962266800000000</guid>
		<dc:creator>Charles</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[Ok Charles, Im glad you asked! &nbsp; I will expound why I snickered a bit at this video. I went back and watched the whole thing again.
<br /><br />First, 45 minute talking head videos aren't exactly exciting to begin with, right?! I see a young guy who has an amazing amount of statistics and probability knowledge, now-drowning in data, with some algorithms and passion about machine-learning. Great!
<br /><br />But, how has he leveraged some things that we have seen at Redmond and elsewhere (in past channel 9 videos), like interpretative machine-learning of sales trends data, and stuff like the great new instrumentation and perf counters sliding downhill in Vista?
<br /><br />You began to question Emre more on the &quot;how's and why's&quot; 35 minutes or so in.. <br />First, let me say doing analysis on reliability with a large-scale distributed system without knowing it's functions or base patterns is near-suicidal. It is akin to trying to analyze water not knowing it's source.. things could be regular, irregular, irregularly
 regular, regularly irregular, and the whole gamet in-between (or maybe not even water!)<br /><br />Nowadays, code-(wh*re)s like me [I'm a code-writing senior-architect for a large company with offices in 63 countries..] are trying to &quot;bake-in&quot; the concept of better instrumentation, exceptional-condition handling, reliable messaging, increased trust and security,
 better profile integrations etc. into our distributed apps, using service-oriented architortures (haha) and implementing plethora's of polymorphic use-cases and workflows. (Whew!)&nbsp;
<br /><br />How does Erme's research in pattern analysis identify clusters at the machine-level of a distributed system statistically?&nbsp; How could data be effected for example (as Erme related to) with a distributed edge-network for content-delivery, or a distinct protocol
 sub-system (like some large-scale distributed source control networks?)&nbsp; <br /><br />How is this machine-pattern analysis affected by denial-of-service attacks to various parts of the system? What about time-zone and multicultural (read:internationalization) issues in the data pools?? What kind of control statistics can we compare patterns
 analysis to if we do not know normal patterns vs. abnormal...<br /><br />What kind of Microsoft technologies becoming available could assist in mitigating failures? (Ie: reliable messaging in WCF, profile services in asp.net for those checkouts, building loosely-granular interfaces, utilizing generics and strong-typing features
 with large data pools, What about CardSpace features for enterprise users, etc. etc. etc. I would be less worried abut the PR guys if you guys could find a way to connect more with us Microsoft-friendly developers using Microsoft technologies, and not just
 (as I said) some conceptual talk about pattern analysis and how much work a phd is at Stanford. (And I'm certain pricey too..
<i>so Emre, if you get down to Portland look me up and I'll buy the coffee.</i>. lol.. )<br /><br />I guess what I would like to see is a second round.. something where we expound on how this can bring value to what we are doing at the enterprise level.&nbsp; How pieces of the large-scale distributed system may return valuable trends data
<i>(such as Christmas season vs. football season shopping)</i> <br /><br />What kind of tactics, scoping, span-of-control and rules affect the base statistical means in determining problem X can be made obsolete to increase reliability.&nbsp;
<br />(As you pointed out, &quot;What about the OS?&quot;.. obviously, there are some inherit reverse-scalability issues in trying to solve statistical problems without large numbers, etc. )
<br /><br />The bottom line is, I thought &quot;This could have been so much more&quot;. Although I am glad I can even criticize it at all.&nbsp; It will just make things that much better here at Channel 9 as we all trudge ahead.
<br /><br />Thanks and have phun <img src='http://ecn.channel9.msdn.com/o9/content/images/emoticons/emotion-1.gif' alt='Smiley' /> <br /><br />-Eric <img src='http://ecn.channel9.msdn.com/o9/content/images/emoticons/emotion-11.gif' alt='Cool' /><p>posted by Adenocard</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962419570000000</link>
		<pubDate>Thu, 12 Oct 2006 09:25:57 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962419570000000</guid>
		<dc:creator>Adenocard</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[
<p>Don't [ever] put your hand over your mouth or nose.</p>
it's really bad body language <img src='http://ecn.channel9.msdn.com/o9/content/images/emoticons/emotion-7.gif' alt='Perplexed' /><br /><br />btw:<br />I liked this video, and I think MSR videos are really important!<br /><p>posted by shoshan</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962463610000000</link>
		<pubDate>Thu, 12 Oct 2006 10:39:21 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962463610000000</guid>
		<dc:creator>shoshan</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[Adenocard,<br /><br />Thank you for the feedback. I appreciate it. I will ask some of your questions in future interviews.
<br /><br />We will continue to cover systems management work going on at Microsoft. There is some very exciting stuff going on that we are not ready to talk about publicly. When we are, you can bet that we will have deep dives into the technologies behind the management
 products here on C9. This video was primarily just a conversation with a researcher working on problems in the management/reliability space. We can't discuss the gory details of Emre's work in a public setting at this point. I hope you understand...<br /><br />Thanks again and keep posting!<br /><br />C<p>posted by Charles</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962641930000000</link>
		<pubDate>Thu, 12 Oct 2006 15:36:33 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962641930000000</guid>
		<dc:creator>Charles</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[
<p>[sorry for not hopping into this conversation earlier, I'm on paternity leave right now, and not checking email/web very frequently]</p>
<p>&nbsp;</p>
<p>Hi Adenocard,</p>
<p>Wow, thanks for taking the time to watch and make your detailed comments.&nbsp; I appreciate it.</p>
<p>First, I'll apologize:&nbsp; this is my first time talking to the channel 9 audience.&nbsp; I obviously misjudged the degree of technical detail that you're interested in seeing.<br /><br /><br />Secondly, analyzing the patterns of behavior in a system without fully understanding the system's architecture/topology is exactly the challenge!&nbsp; The issue is that with the large-scale systems that I'm looking at (primarily Internet services, not enterprise
 systems), the system changes frequently enough that any assumptions you make about base patterns and system characteristics might not be valid for long!&nbsp; And certainly, making assumptions about the characteristics of failures is a very bad idea!<br /><br />The overall goal&nbsp;is then to develop scalable techniques that help operators understand their system's behavior (both its correct and incorrect operation) by summarizing patterns/detecting anomalies/etc while making only minimal assumptions (not &quot;no assumptions&quot;)
 about the system and its behavior.&nbsp; On the machine learning side of things, this involves leaning heavily on non-parametric methods and robust statistics.&nbsp; On the systems side of things, the main question is choosing key instrumentation points and figuring
 out how to best respond to automated analyses (alerting operators, rebooting machines, etc) to achieve the higher-level goal of maintaining a reliable system.<br /><br />What makes this partially tractable is that right now, 1)&nbsp;with a large system,&nbsp;you really&nbsp;get to take&nbsp;advantage of statistical properties of&nbsp;seeing lots of data points; and 2) these techniques are not looking for anything very subtle --- people can see these
 patterns pretty easily.&nbsp; Unfortunately, if you're trying to monitor 10k&#43; streams of data, you can't find enough people to look at them all individually, and you certainly can't flood a single person with all the data to get a sense of a &quot;global&quot; view.<br /><br /><br />Feel free to email me if you want to chat more.<br /><br />---<br /><br />JParrish: Today,&nbsp;*people* definitely see and use these patterns all the time.&nbsp; Unfortunately, people don't scale well as systems grow even to 10k nodes (much less the&nbsp;larger Internet services!).&nbsp; There are some automated monitoring systems (e.g., I think zenprise
 is neat) out there today, but they're very specialized and have a lot of in-built knowledge about the systems they monitor.<br /><br />---<br /><br />Shoshan:&nbsp; Ugh, that's horrible!&nbsp; Did I actually do that?<br /><br />---<br /><br />Thanks again to all of you&nbsp;for taking the time to comment.<br /><br />Cheers,<br />Emre<br /><br /><br /><br /></p>
<p>posted by Emre Kiciman</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962809920000000</link>
		<pubDate>Thu, 12 Oct 2006 20:16:32 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632962809920000000</guid>
		<dc:creator>Emre Kiciman</dc:creator>
	</item>
	<item>
		<title>Re: Emre Kiciman: Reliable Computing for Large Scale Distributed Systems</title>
		<description>
			<![CDATA[First of all it was a very interesting video for me... And i loved that a turkish is speaking in C9, from his name and the way of saying his name i see that he is turkish like me as well <img src='http://ecn.channel9.msdn.com/o9/content/images/emoticons/emotion-1.gif' alt='Smiley' /> I hope we will listen more about his research. Actually it is
 very hard topic for me but it gave some new ideas about such large scale systems. I am studying computer science enginnering in www.deu.edu.tr and hope that i ll be a professional like him.
<br /><br />Greetings Emre Abi, i hope we listen more from u...<img src='http://ecn.channel9.msdn.com/o9/content/images/emoticons/emotion-4.gif' alt='Tongue Out' /><br /><br />PS: About body language: He is putting his hand to his chest while listening, this means in turkish culture (i think also europeans do it as well) that &quot;i am listening u carefully&quot; ...<br /><br /><p>posted by xenonysf</p>]]>
		</description>
		<link>http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632964562080000000</link>
		<pubDate>Sat, 14 Oct 2006 20:56:48 GMT</pubDate>
		<guid isPermaLink="true">http://channel9.msdn.com/Blogs/Charles/Emre-Kiciman-Reliable-Computing-for-Large-Scale-Distributed-Systems#c632964562080000000</guid>
		<dc:creator>xenonysf</dc:creator>
	</item>
</channel>
</rss>