An Architecture for COnsistenct Nomadic Content Access
Motivation | Our Approach | Projects | People | Publications
Motivation: Future access to web-based content is likely to be dominated by two trends: (a) increasing amounts of dynamic, personalized content, and (b) a significant growth in ``on-the-move'' access using various mobile resource-constrained devices. These trends point to a situation where a user would have ubiquitous access to content, but require that content be efficiently delivered to the user irrespective of location, and in a form most suited to the user's end device. Unfortunately, classical caching and transcoding solutions do not work well together particularly in light of the above trends, necessitating a new caching architecture built from the ground-up to handle problems caused by dynamic content, transcoded versions of objects, and the nomadic nature of users.
Our approach: Based on the following two important observations: (1) despite the dynamically generated and personalized nature of web content, at the underlying level a relatively large amount of such content can in fact be shared, as shown in the following personalized yahoo pages, where s1, s2, s3, s4, and s5 are sharable, p1 and p2 are personalized objects.
(2) Though people are ''on the move'' to access information and services by mobile devices, most users exhibit a relatively static access pattern, often starting from a set of popular documents and following the links contained therein.
These two observations suggest our novel cache architecture named CONCA (COnsistent Nomadic Content Access), which attempts to support, from the ground up, caching of dynamic personalized content for (mobile) users by (1) reusing the shared portions of dynamic content; (2) exploiting knowledge of user content access preferences to efficiently support transcoding and nomadic access (e.g., by prefetching).(3) acting as an edge server to absorb some server load to the edge.
The logical organization of each CONCA node is show as follows, which consists two parts: shared and personalized. More details can be found at our overview paper.
Modeling Object Characteristics of Dynamic Web Content
By analyzing the content of six web sites that serve dynamic content over a two week period, we derive a set of models that characterizes this content in terms of a small number of independent parameters. Our studies find that the sizes and freshness times of component objects can be captured very well using Exponential and Weibull distributions respectively, and demonstrate siginificant content reusability across both the temporal and spatial dimensions.
Dynamic Content Emulator (DYCE)
Based on our recent work on modeling object characteristics of dynamic web content, and we have designed and implemented a Java-based dynamic content emulator (DYCE, pronounced as "dice"), which can serve the request for document templates and objects respectively (see DYCE paper) . DYCE is public available now. Currently, we are extending the function of DYCE.
Service Execution Environment for Edge Computing
We are also investigating the extension of CONCA node to support value-added services, and edge services. ICAP and/or SOAP are used to cooperate with Internet services, such as image filtering, language translation etc. A service execution container which will be integrated in CONCA node is under developing.
Workload Characterization of Personalized Web Site
Cooperating with NYUHome team, we recently analyzed the characteristics of NYUHome, which is a typical personalized web site, and studied the implications of these characteristics on dynamic web caching. A two-week period of NYUHome traces will be public available soon. If you are eager to play with the trace, please send email to firstname.lastname@example.org.
CONCA Prototype with ESI Support
Using edge side include (ESI) as the language to describe document template and part of personal assistants. A CONCA prototype is being implemented.
Dr. Weisong Shi
Vijay Karamcheti (NYU)
Eli Collins (NYU)
Vidula Pant (moved to CSFB)
Technical Reports and Publications
Vikrant Mastoli, Valmik Desai and Weisong Shi, ICAP vs. SOAP: Which One is Better for Edge Services?, February 2003, Technical Report CS-MIST-TR-2003-002, submitted.
Weisong Shi, Eli Collins and Vijay Karamcheti, Modeling Object Characteristics of Dynamic Web Content, to appear in Proceedings of the IEEE Globecom 2002 conference, Taipei, China, November 17-21, 2002. ( The full version is available as NYU computer science department technical report TR2001-822).
Weisong Shi, Randy Wright, Eli Collins and Vijay Karamcheti, Workload Characterization of a Personalized Web Site ---- and Its Implications for Dynamic Content Caching, to appear in Proceedings of the 7th International Conference on Web Content Caching and Distribution (WCW'02), Boulder, Colorado, August 14-16, 2002. (The full version is available as NYU computer science department technique report TR2002-829, April, 2002).
Service Execution Container for Edge Computing Platform, Master thesis by Vidula Pant, (Supervised by Dr. Weisong Shi and Prof. Vijay Karamcheti), May 2002.
Weisong Shi, Eli Collins and Vijay Karamcheti, DYCE: Model-based Emulation of Dynamic Web Content , to appear in Poster Proceedings of 11th International World Wide Web Conference (WWW 2002), Hawaii, May 7-11, 2002 (The full version of this paper is available here ).
CONCA: An Architecture for Consistent Nomadic Content Access, W. Shi and V. Karamcheti, in Workshop on Cache, Coherence, and Consistency (WC3'01), June 2001, Italy.